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1^. (57) Abstract: The present invention relates to the generation and use of a cellular array or a cellular array in combination with 
other genome-registered arrays (an array of arrays) for the determination of gene function and/or perturbation mode of action. Each 
cellular array consists of a number of microbial strains. Each strain comprises one reporter gene fusion made up of a gene or gene 
fragment opcrably linked to a reporter gene. Each gene or gene fragment has been "registered" or mapped to a specific location in the 
genome of the organism. The genome-registered collection of the invention may be used to determine alterations in gene expression 
under a variety of conditions. Such collections are amenable to rapid assay and may be used to confirm, correct or augment data 
generated from DNA micro array technology. 
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TITLE 

CELLULAR ARRAYS FOR THE IDENTIFICATION OF ALTERED GENE 

EXPRESSION 

This application claims the benefit of U.S. Provisional Application No. 60/197,348 
filed April 14,2000. 

FIELD OF THE INVENTION 
This invention is in the field of bacterial gene expression. More specifically, this 
invention describes a method to monitor transcriptional changes on a genome-wide scale 
using a genome-registered gene fusion collection. 

BACKGROUND OF THE INVENTION 
DNA array analysis is a powerful method for comprehensive genome analysis of 
gene expression. Currently, this approach is the only available method for massively parallel 
analyses that allow the expression of each gene of a bacterial genome to be characterized 
simultaneously (Richmond etal., (1999) Nucleic Acids Res. 27:3821-3835., 17, 25; Tao 
et al., (1999), J. Bacteriol. 181:6425-6440;Wilson et al., (1999) Proc. Natl. Acad. Sci. U. S. 
^.96:12833-12838). 

Richmond et al. ((1999) Nucleic Acids Research, 27:3821-3835) has recently 
reported genome-wide expression profiling of E. coli at the single ORF level of resolution. 
Changes in RNA levels after exposure to heat shock or IPTG were analyzed using 
comprehensive low density blots of individual ORFs on a nylon matrix and comprehensive 
high density arrays of individual ORFs spotted on glass slides. The results of the two 
methods were compared. Richmond et al. states that radioactive probe/spot blots are inferior 
to fluorescent probe/micro-arrays. Moreover, the comparison of heat shock treatment 
between the two methods is fundamentally flawed since the RNA analyzed with spot blots 
were derived from broth grown cultures while those analyzed with micro-arrays were 
derived from cells grown in defined media. Despite the power of this new methodology, 
there are several problems that limit the reliability of results. For example, artifacts may 
arise during the isolation of microbial RNA (Tao et al., (1 999) J. Bacteriol. 1 8 1 :6425-6440) 
or from cross hybridization to paralogous genes (Richmond et al., (1 999) Nucleic Acids Res 
27:3821-3835, 17,25). 

Another limitation of DNA array methodology is that RNA must be isolated, 
converted into DNA by reverse transcriptase with concomitant incorporation of fluorescent 
labels. These steps make it unlikely that facile high throughput screens could be developed 
based on DNA array technology. Thus, there exists a need for a method that adapts results 
from DNA array technology into high throughput screens. For the reasons mentioned above 
and others, alternative genome-wide expression profiling method as well as rapid methods to 
independently verify results from DNA array experiments are needed. 
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Gene fusion technology is an established method for gene expression monitoring. 
For example, the initial discovery of the SOS (DNA damage responsive) regulon of E. coli 
was done by Kenyon and Walker ((1980) Proc. Natl. Acad. Sci. U. S. A. 77:2819-2823) by 
comparing the transcriptional responses of Escherichia coli to mitomycin C (MMC), a DNA 
5 damaging agent that intercalates into and forms a covalent attachment with double-stranded 
DNA. While these early experiments attempted to scan the bulk of the E. coli genome by 
using a transposon that put the lacZYA operon under the control of many promoter regions, it 
was not known if the entire genome had been surveyed because of the random nature of 
transposition and unknown location of the majority of transposition events. Accordingly, 

10 additional SOS regulon genes have been identified since these early experiments (Lomba 
et aL, (1997) Microbiol Lett 156:1 19-122; Walker, (1996) In Escherichia coli and 
Salmonella: Cellular and Molecular Biology. ASM Press, pp 1400 1416). 

LaRossa et aL (U.S. 5,683,868) has transformed E. coli with at construct comprised 
of luxCDABE operably linked to a variety of stress promoters. They have used the 

15 microorganisms to detect a variety of environmental insults such as Ethanol, CdCl 2 and 
toluene. The presence of sublethal concentration of insults is indicated by an increase in 
bioluminescence. However, in order to generate the transformed host, the stress promoters 
has to be identified and characterized. Furthermore, this method is limited to the stress 
response only. 

20 Ashby and Rine (U.S. 5,569,588) reported a method to measure the transcriptional 

responsiveness of an organism to a candidate drug by detecting reporter gene product signals 
from separately isolated cells of a target organism on genome-wide bases. Each cell 
contains a recombinant construct with the reporter gene operatively linked to a different 
endogenous transcriptional regulatory element of the target organism When cells were 

25 treated with a candidate drug, the transcriptional responsiveness of the organism to the 

candidate drug was measured by the detecting the reporter signal from each cells. However, 
this method is useful with the organism only after the majority of transcriptional regulatory 
elements of the target organism are known and mapped. Furthermore, the reporter signals 
are measured only after cells reached homeostasis in the presence of drug. The initial 

30 transcriptional responses to chemicals are not considered. 

The Lux-A Collection of random E. coli genomic DNA fused to the luxCDABE had 
been used to screen for those gene fusions for which expression was induced by treatment 
with the herbicide sulfometuron methyl. The DNA sequence of 19 of these sulfometuron 
methyl inducible gene fusions {smi-lux) was determined and used to identify the promoter 

35 controlling expression of the luxCDABE reporter (Van Dyk et al., (1 998) J. Bacteriol. 
1 80:785-792); the remaining 8047 gene fusions remained unidentified. 

LaRossa and Van Dyk (U.S. 6,025, 131) developed a method for the identification of 
gene regulatory regions, responsive to a particular cellular stress, such as that produced by 
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herbicides or crop protection chemicals by randomly fusing regulatory regions to a bacterial 
luminescent gene complex where contacting the fusion in a suitable host with a cellular 
insult producing a cellular stress results in detection of that cellular stress by an increase in 
cellular luminescence. However, this method was limited to the perturbations in liquid 
5 media; luminescent responses were not detected on solid medium following overnight 

growth in the presence of a chemical stress. Furthermore, it did not allow regulatory region 
activity analysis in genome-wide scale. 

The problem to be solved therefore is to provide a way to measure and follow the 
changes in gene expression using a genome-registered collection of reporter gene fusions in a 
10 manner that allows detection of initial transcriptional responses, and provide a way to cross- 
validate the results from other method (i.e., microarray) as well as to determine promoter and 
operon structure of genes, and further provide a way to test cellular responses to various 
environmental and genetic changes in high throughput manner. 

SUMMARY OF THE INVENTION 
A new method for the use of genome registered collection of reporter gene fusion is 
disclosed. Fragments of genomic DNA of host organism were fused to promoterless reporter 
gene. The reporter gene fusions were generated using restriction enzyme digestion, physical 
shearing of the genomic DNA, PCR, and transposition techniques. The reporter gene 
complexes were genome registered against the host genome on the basis of homology. Gene 
expression of each reporter gene complex is measure as reporter gene activity. The present 
invention provides a means to measure the changes in gene expression profiles in genome 
wide scale under various conditions in high throughput manner. In addition to being a stand- 
alone high throughput method, the present invention also provides a way to validate other 
genome-wide assays such as DNA microarray. The present invention also provides a 
method to confirm the response of several promoters to a particular insult (a condition or 
chemical of interest) as well as to identify a number of previously unknown operons 
responsive to that insult. Comparison of the gene expression patterns of two samples 
differing in one variable is also possible using this method. The present invention also 
provides the method to use an array of arrays by generating gene expression profiles that 
30 yield information relevant to understanding gene function and modes of chemical action. 
Such information can be gained by analysis of genetic alterations resulting in loss of 
function, reduced levels of gene products, or over-expression of gene products. Thus, an 
array of arrays can be used to enhance both mode of action studies and functional genomics. 

In this invention, the sequencing and genome-registering of the majority of the 
Lux-A Collection members were completed. Lux fusions in the Lux-A Collection were 
fragments of E. coli genomic DNA fused to promoterless luxCDABE gene. The genome- 
registered collection of lux fusions were examined for the biological responses measured by 
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changes in bio luminescence. The present invention provides a means to measure the 
changes in gene expression profiles under various conditions. 

In addition to being a stand-alone high throughput method, the present invention also 
provides the way to validate or detect false-positive result from other genome wide assays 
5 such as microarray. 

The present invention provides a method to confirm the response of several 
promoters to a particular insult as well as to identify a number of previously unknown 
operons responsive to that insult. 

The present invention provides a method for comparing the gene expression patterns 
10 of two samples differing in one variable. The variables may include but not limited to 
genotype, media, temperature, depletion or addition of nutrient, addition of an inhibitor, 
physical assault,, biological assaults, irradiation, heat, cold, elevated or lowered pressure, 
desiccation, low or high ionic strength, and growth phases. 

The present invention also provides the method to use an "array of arrays" by 
15 generating gene expression profiles that yield information relevant to understanding gene 
function and modes of chemical action. Such information can be gained by analysis of 
genetic alterations resulting in loss of function, reduced levels of gene products, or over- 
expression of gene products. Thus, an array of arrays can be used to enhance both modes of 
action studies and functional genomics. 
20 Thus the invention provides a method for identifying altered gene expression 

between at least two genome-registered collections comprising: 

(a) assembling at least two genome-wide scale,; genome-registered collections; 

(b) perturbing each collection from (a) with at least one perturbation; 

(c) measuring the response of each collection to each perturbation of (b); 
25 (d) analyzing the results of the at least one perturbation to identify genetic 

differences between the at least two genome-registered collections. 

Additionally the invention provides a method for generating a genome-registered 
collection of reporter gene fusions comprising the steps of: 

(a) generating a set of gene fusions comprising: 

30 1) a reporter gene or reporter gene complex operably linked to 

2) a genomic fragment from an organism of which at least 1 5% of the 
genomic nucleotide sequence is known; 

(b) introducing in vitro the reporter gene fusions from step (a) into a host organism; 

(c) registering the reporter gene fusions on the basis of sequence homology to the 
35 genomic sequence of the organism; 

(d) repeating (a), (b), and/or (c) until reporter gene fusions have been made to at least 
1 5% of the known genomic nucleotide sequence of said organism. 
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Similarly the invention provides a method for generating a genome-registered 
collection of reporter gene fusions comprising: 

(a) generating random nucleic acid fragments from the DNA of an organism of 
which at least 15% of the nucleotide sequence is known; 
5 (b) operably linking the random nucleic acid fragments generated in (a) to a vector 

containing a promoterless reporter gene or reporter gene complex; 

(c) introducing the vector (b) containing the gene fusions into a host organism; 

(d) determining the nucleic acid sequence of the distal and the proximal ends of the 
random nucleic fragments relative to the reporter gene or reporter gene complex; 

10 (e) registering the sequenced fusions of step (d) on the basis of sequence homology 

to the genomic sequence of the host organism; 

(d) repeating (a), (b), and/or (c) until reporter gene fusions have been made to at least 
15% of the known genomic nucleotide sequence of said organism. Generation of the random 
nucleic acid fragments of step may incorporate restriction enzyme digestion, physical 
1 5 shearing of the genome and polymerase chain reaction. 

In another embodiment the invention provides a method for generating a genome- 
registered collection of reporter gene fusions comprising steps of: 

(a) introducing one or more transposons into the genome of an organism of which at 
least 1 5% of the nucleotide sequence is known, each transposon containing a promoterless 

20 reporter gene or reporter gene complex; 

(b) determining the nucleic acid sequence of the junction between the proximal end 
of the genomic DNA and the transposon containing the reporter gene or reporter gene 
complex and registering the reporter gene fusions relative to the genomic sequence of the 
organism, 

25 ( C ) repeating (a) and (b) until reporter gene fusions have been made to at least 1 5% 

of the known genomic nucleotide sequence of said organism. 

Alternatively the invention provides a method for identifying a profile of inducing 
conditions for a reporter gene fusion comprising: 

(a) obtaining a gene expression profile of an organism under induced and non- 
30 induced conditions wherein induced genes are identified; 

(b) providing a genome-registered collection of reporter gene fusions, said fusions 
registered to the genome of the organism of (a); 

(c) selecting the reporter gene fusions of (b) that correspond to the induced genes of 
(a) to create a subset of the genome-register collection; 

(d) contacting the subset of the genome-register collection of (c) with the inducing 
conditions of (a) to identify at least one representative reporter gene fusion whose expression 
was altered in a similar manner as in (a); 
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(e) contacting the at least one representative reporter gene fusion of (d) in a high 
throughput manner with a multiplicity of different inducing conditions to identify a profile of 
inducing conditions for that reporter gene fusion. 

In another embodiment the invention provides a method for generating a genome- 
5 registered collection of reporter gene fusions comprising: 

(a) providing a genome from an organism wherein at least 1 5% of the nucleotide 
sequence is known; 

(b) providing a series of amplification primers having homology to specific known 
regions of the genome of (a) ; 

10 (c) amplifying portions of the genome of (a) with the primers of (b) to create a 

collection of nucleic acid amplification products; 

(d) operably linking the amplification products of (c) to a vector containing a 
promoterless reporter gene or reporter gene complex; 

(e) introducing the reporter gene fusions into a said organism; 

15 (f) repeating (a) - (e) until, until reporter gene fusions have been made to at least 

1 5% of the known genomic nucleotide sequence of said organism. 

In another embodiment the invention provides a method for analyzing a gene 
expression-based high throughput assay comprising: 

(a) analyzing a genome-wide gene expression assay of a microbial culture with a 
20 genotype of interest relative to that of a congenic strain to identify genes whose expression 

differs as a function of the allele; 

(b) selecting corresponding reporter gene fusions containing promoter regions 
operably linked to the genes with altered expression from (a) or co-regulated reporter gene 
fusions containing promoter regions operably linked to the genes co-regulated with those 

25 displaying altered expression from (a) from the genome-registered collection of reporter gene 
fusion; 

(c) testing expression of the reporter gene fusions selected from (b) by introducing 
the reporter gene fusions into the congenic strains; 

(d) choosing at least one representative reporter gene fusion whose expression 
30 differed in the two strains tested in (c); 

(e) using the reporter gene fusion as a high throughput screen to find other chemicals 
or genes that alter expression of the fusion. 

Similarly it is an object of the invention to provide a method to validate results from 
comprehensive genome analysis comprising the steps of: 
35 (a) analyzing a genome-wide, gene expression assay of an organism treated with a 

condition or chemical of interest to identify genes with altered expression; 

(b) selecting from a genome-registered collection of reporter gene fusions those 
reporter gene fusions containing promoter regions operably linked to genes corresponding to 
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the altered genes from (a) or genes co-regulated with genes corresponding to the altered 
genes from (a); 

(c) testing expression of the reporter gene fusions selected from (b) with the 
conditions or chemicals of interest used in (a); and 

5 (d) comparing the gene expression results from (c) to the gene expression result of 

(a). 

The invention additionally provides a method to determine operon structure 
comprising steps of: 

(a) selecting a subset of reporter gene fusions from a genome-registered collection of 
10 reporter gene fusions that map to the region of a possible operon; 

(b) assaying the subset for the reporter gene function; and 

(c) determining a putative operon structure based on the quantities of reporter gene 
function. 

Alternatively the invention provides a method for constructing a cellular array 
15 containing reporter gene fusions comprising: 

(a) generating a set of gene fusions comprising: 

1) a reporter gene or reporter gene complex operably linked to 

2) a genomic fragment from an organism of which at least 15% of the 
genomic nucleotide sequence is known; 

20 (b) selecting a non-redundant subset of reporter gene fusions from the set of (a) 

representative of at least 15% of known or suspected promoter regions from a genome- 
registered collection of reporter gene fusions, each containing a known or suspected 
promoter region operably linked to a reporter gene or reporter gene complex; and 

(c) fixing the non-redundant subset of reporter gene fusions of (b) in an array format. 
In a preferred embodiment the invention provides a method for measuring gene 
expression responses to perturbation comprising: 

(a) constructing at least 2 identical cellular arrays, each cellular array comprising a 
reporter gene fusion comprising: 

1) a reporter gene or reporter gene complex operably linked to 
30 2 ) a genomic fragment from an organism of which at least 1 5% of the 

genomic nucleotide sequence is known; 
wherein at least one cellular array is a control array and at least one cellular array is 
an experimental array; 

(b) contacting the experimental array of (a) with a perturbing condition; 

(c) comparing the differences between the gene expression activity of the control and 
the experimental array wherein gene expression response to a perturbing condition is 
determined. 
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Organisms amenable to the present method include prokaryotes and fungi and 
particularly enteric bacterium. 

Reporters useful in the present method include luxCDABE, lacZ, gfp, cat, galK, inaZ, 
luc, luxAB, bgaB, nptII,phoA, iridA andxy/J?. 
5 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a diagram of fusions to the lac operon from the Lux-A Collection. The 
direction of the arrowhead indicates if the DNA of the cloned chromosomal segment is 
oriented such that a promoter, if present, would be driving expression of genes on the direct 
(arrowhead to the right) or complementary (arrowhead to the left ) strand. 
10 Figure 2 describes kinetics of induction of increased bioluminescence from E. coli 

strain DPD3232 containing ayebF-hvcCDABE fusion treated with various concentrations of 
MMC. 

Figure 3 (A-C) describes generally the Luxarray 0.5 luminescence during solid phase 
growth. Figure 3A describes the cell placement to replicate 96 well spacing to fit a 
15 microplate luminometer. In the white light on the left, each replicate of the twelve spots is 
outlined. The bioluminescent image of the same plate is shown on the right. Figure 3B 
describes the signal collected for each clone as a function of time. Each cycle was about 
20 min. Figure 3C describes signals collected from 8 replicates of same clone. 

Figure 4 (A-D) describes generally the Luxarray 0.5 perturbation with nalidixic acid 
20 (NA). Figure 4A represents results from strains containing the parental plasmid and 

two non-responding reporters, o^wF, and lacZYA and Figure 4B represents the results from 
three DNA damage responsive reporters, uvrA, recA, dinG. 

Figure 5 describes High density Luxarray 0.5 perturbation with nalidixic acid. The 
squares represent reporter gene response from nalidixic acid treated cultures and the circles 
25 represent reporter gene activity from untreated cultures. 

Figure 6 describes graphical representation of selection criteria implemented to select 
Luxarray 1 .0 Clones. 

Figure 7 describes bioluminescence captured from Luxarray 1.0 reporters after 14 In- 
growth on LB media by cooled CCD array camera (FluorChem 8000, Alphalnnotech). 
30 Figure 8 describes the predicted promoters in a gene cluster of 20 genes for type! 

extracellular polysaccharide in E. coli and the promoter activity of. luxCDABE gene fusions 
from the region encoding production of type I extracellular polysaccharide. The E. coli 
genes in this region are shown with the top set of arrows. Above this line are the predicted 
promoters regions from two sources (P, Blattner et al., (1997) Science 277:1453-1462); P, 
35 Thieffry et al. (1998) Bioinformatics 14:391-400). The two promoters supported by DNA 
array and gene fusion data presented here are shown in bold type. The b designation and 
common name for each gene is shown. The ratios of the deduced mRNA level in the rpoC 
mutant strain to the deduced mRNA level in the rpoC*~ strain determined by the micro array 
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metliod are shown on the next line. The mapped location of the chromosomal inserts of 
selected IvxCDABE gene fusions in this region are shown with the second set of thicker 
arrows. Below these are the lux clone identification number and bioluminescence data 
(RLU/1 09 cfia) for each plasmid in the rpoC + E coli host strain and the host strain carrying 
an rpoC mutation. In the case of overlapping gene fusions, the shorter one is listed first. 

Figure 9 (A-D) represents generally the induction of the promoter bl 728 by nalidixic 
acid (NA) in lexA+ host (Figure 9A) in comparison to the promoter in lexind host 
(Figure 9C), and induction of the promoter £7725 by mitomicyn C (MC) in lexA+ host 
(Figure 9B) in comparison to the promoter in lexindhost (Figure 9D). 

Figure 10 represents uhpT-lux upregulation by limonene. 

The invention can be more fully understood from the following detailed description 
and the accompanying sequence descriptions which form a part of this application. 

DETAILED DF.SCRT PTION OF THE INVENTION 

The present invention relates to the generation and use of a cellular array or a 
cellular array in combination with other genome-registered arrays (an array of arrays) for 
the determination of gene function and/or perturbation mode of action. Each cellular array 
consists of a number of microbial strains. Each strain comprises one reporter gene fusion 
made up of a gene or gene fragment operably linked to a reporter gene. Each gene or gene 
fragment has been "registered" or mapped to a specific location in the genome of the 
organism. Cellular arrays of the present invention are those that contain reporter gene 
fusions to at least 15 % of the genome of the organism being analyzed. Cellular arrays 
containing these reporter gene fusions are referred to herein as "registered collections" 

The genome-registered collection of the invention may be used to determine 
alterations in gene expression under a variety of conditions. Such collections are amenable 
to rapid assay and may be used to confirm, correct or augment data generated from DNA 
micro array technology. 

In this disclosure, a number of terms and abbreviations are used. The following 
definitions are provided. 

"Open reading frame" is abbreviated ORF. The term "ORF" is refers to a gene that 
specifies a protein. 

"Polymerase chain reaction" is abbreviated PCR. 

As used herein, an "isolated nucleic acid fragment" is a polymer .of RNA or DNA 
that is single- or double-stranded, optionally containing synthetic, non-natural or altered 
nucleotide bases. An isolated nucleic acid fragment in the form of a polymer of DNA may 
be comprised of one or more segments of cDNA, genomic DNA or synthetic DNA. 

A "cellular array" means a set of strains differing from one another only in the 
chromosomal fragment fused to a reporter gene or reporter gene complex. 
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The term "array of arrays" refers to a collection of cellular arrays wherein each 
individual cellular array contains reporter gene fusions responsive to a specific inducing 
condition or set of conditions. 

The terms "global scale" or "genome-wide scale" when applied to reporter gene 
5 fusions refer to a minimum 15% representation of the transcription units of an organism. For 
example, there is predicted to be 2328 transcription units (t.u.s) in E. coli (RegulonDB 
Database v.3.0, http://www.cifn.unam.mx/Computational_Biology/Exoli-predictions/). 
Thus a set of unique fusions representing 329 t.u.s (15% of 2328) is at "global scale". The 
term "global scale" or "genome-wide scale" when applied to mutations or overexpression 
10 refers to a minimum 15% representation of the open reading frames of an organism. 

The term "genome register" refers to a procedure of precisely locating or mapping a 
defined nucleic acid sequence within a genome. 

The term "genome-registered collection" refers to a set of strains containing reporter 
gene fusions, loss of gene function mutations, altered gene function mutations, or 
15 overexpressed genes that have been "registered" or mapped by homology to the nucleic acid 
sequence of the genome of the organism. These genome-registered collections include 
reporter gene fusions to at least 15%, more preferably at least 20%, and most preferably at 
least 50% of all known or predicted promoter regions, loss of gene function mutations in at 
least 15%, more preferably at least 20%, and most preferably at least 50%, of all known or 
20 predicted ORFs, altered gene function mutations in at least 15%, more preferably at least 

20%, and most preferably at least 50%, of all known or predicted ORFs, or overexpression of 
in at least 15%, more preferably at least 20%, and most preferably at least 50%, of all known 
or predicted ORFs. 

As used herein the term "known" as applied to a gene or ORF within the context of a 
25 genome means that the sequence of the gene or ORF is known and should not be limited to 
knowledge of the function of that gene or ORF. 

The term "reporter gene fusion" refers to a chimeric gene consisting, in one part, of a 
gene or genes that are useful to detect transcription and/or translation initiated in the other 
part. 

30 The term "reporter gene complex" refers to any set of two or more genes that together 

are useful to generate a measurable signal. 

. The term "reporter gene activity" or "reporter construct activity", refers to the 
accumulation of the limiting component(s) of the reporter system in use which results in the 
measurable signal associated with that reporter system. The measured activity is the sum 
35 result of the de novo production, ongoing degradation or deactivation, and/or activation of 
the limiting component(s). 

The term "Lux-A Collection" refers to a specific set of E. coli strains containing 
plasmid-borne luxCDABE fusions. 

10 
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The term "lux fusion" or "hixCDABE fusion" refers to a chimeric gene consisting, of 
a genomic sequence joined to luxCDABE genes. 

The term "congenic strains" refer to two or more strains that differ from one another 
by a single mutation or a strain differing in only one gene to denote genes whose expression 
differ as a function of the allele. 

The term "DNA microarray" or "DNA chip" means assembling PCR products of a 
group of genes or all genes within a genome on a solid surface in a high density format or 
array. General methods for array construction and use are available (see Schena M, Shalon 
D, Davis RW 5 Brown PO., Quantitative monitoring of gene expression patterns with a 
complementary DNA microarray. Science. 1995 Oct 20; 270(5235): 467-70. and 
http://cmgm.stahford.edu/pbrown/mguide/index.htmn . A DNA microarray allows the 
analysis of gene expression patterns or profile of many genes to be performed 
simultaneously by hybridizing the DNA microarray comprising these genes or PCR products 
of these genes with cDNA probes prepared from the sample to be analyzed. DNA microarray 
or "chip" technology permits examination of gene expression on a genomic scale, allowing 
transcription levels of many genes to be measured simultaneously. Briefly, DNA microarray 
or chip technology comprises arraying microscopic amounts of DNA complementary to 
genes of interest or open reading frames on a solid surface at defined positions. This solid 
surface is generally a glass slide, or a membrane (such as nylon membrane). The DNA 
sequences may be arrayed by spotting or by photolithography (see 

http://www.affymetrix.com/). Two separate fluorescently-labeled probe mixes prepared 
from the two sample(s) to be compared are hybridized to the microarray and the presence 
and amount of the bound probes are detected by fluorescence following laser excitation 
using a scanning confocal microscope and quantitated using a laser scanner and appropriate 
array analysis software packages. Cy3 (green) and Cy5 (red) fluorescent labels are routinely 
used in the art, however, other similar fluorescent labels may also be employed. To obtain 
and quantitate a gene expression profile or pattern between the two compared samples, the 
ratio between the signals in the two channels (redrgreen) is calculated with the relative 
intensity of Cy5/Cy3 probes taken as a reliable measure of the relative abundance of specific 
mRNAs in each sample.. Materials for the construction of DNA microarrays are 
commercially available (Asymetrix (Santa Clara CA) Sigma Chemical Company (St. Louis, 
MO) Genosys (The Woodlands, TX) Clontech (Palo Alto CA) and Corning (Corning NY). 
In addition, custom DNA microarrays can be prepared by commercial vendors such as 
Affymetrix, Clontech, and Corning. 

The basis of gene expression profiling via micro-array technology relies on 
comparing an organism under a variety of conditions that result in alteration of the genes 
expressed. A single population of cells may be exposed to a variety of stresses that will result 
in the alteration of gene expression. Alternatively, the cellular environment may be kept 
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constant and the genotype may be altered. Typical stresses that result in an alteration in gene 
expression profile will include, but is not limited to conditions altering the growth of a cell 
or strain, exposure to mutagens , antibiotics, UV light, gamma-rays, x-rays, phage, 
macrophages, organic chemicals, inorganic chemicals, environmental pollutants, heavy 
5 metals, changes in temperature, changes in pH, conditions producing oxidative damage, 
DNA damage, anaerobiosis, depletion or addition of nutrients, addition of a growth 
inhibitor, and desiccation. Non-stressed cells are used for generation of "control" arrays and 
stressed cells are used to generate an "experimental", "stressed" or "induced" arrays. Induced 
arrays are those that demonstrate "altered gene expression". 
10 "Altered gene expression" refers to the change in the level of a transcription or 

translation products. If the gene is "up-regulated", the level of transcription or translation 
products is elevated. If the gene is "down-regulated" the level of transcription or translation 
products is decreased. Conditions under which gene expression is altered is also known as 
an "inducing condition". In some instance a number of different conditions may result in the 
15 same transcriptional effect on a single gene. Thus a number of inducing conditions will 
either up-regulate or down regulate the same gene. This collection of like inducing 
conditions is known as a "profile of inducing conditions". Similarly the term "perturbation" 
as used herein in reference to a cellular array or a DNA micro array is any alteration of 
environment or genotype the results in altered gene expression. 
20 The terms "high density" or "comprehensive" micro array refers to a high-density 

DNA micro-array containing at least 75% of the open reading frames of the organism. 

The term "altered gene expression" refers to the change in the level of a transcription 
or translation products. If the gene is "upregulated", the level of transcription or translation 
products is elevated. If the gene is "downregulated" the level of transcription or translation 
25 products is decreased. 

The term "expression profile" refers to the expression of groups of genes. 

The term "gene expression profile" refers to the expression of individual gene and 
suite of individual genes. 

The "comprehensive expression profile" refers to the gene expression profile of more 
30 than 75% of genes in the genome. In "comprehensive gene expression analysis" at least 
>75% of gene expression of the organism is analyzed. 

The term "corresponding to" is used herein to refer to similar or homologous 
sequences, whether the exact position is identical or different from the molecule to which the 
similarity or homology is measured. A nucleic acid or amino acid sequence alignment may 
35 include spaces. Thus, the term "corresponding to" refers to the sequence similarity, and not 
the numbering of the amino acid residues or nucleotide bases. 
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The terms "print" or "printing" refer to transferring one or more cultures from one 
locale, most often a multiwell culture plate, to a substrate or solid surface by physical contact 
transfer or any of a plurality of technologies. 

The term "regulon" refers to groups of operons sharing similar regulation. 

The term "operon" refers to a unit of bacterial gene expression and regulation, 
including structural genes and control elements in DNA recognized by regulator gene 
product(s) that in combination support the production of an mRNA. 

The term "probe" refers to a single-stranded nucleic acid molecule that can base pair 
with a complementary single stranded target nucleic acid to form a double-stranded 
molecule. 

The term "genotype" refers to the genetic constitution of an organism as 
distinguished from its physical appearance. 

The term "genomic DNA" refers to a single complete set of genetic information 
carried in the chromosomes of an organism. 

The term "total RNA" refers to non-fractionated RNA from an organism. 

The terms "protein specifying RNA" or "protein specifying transcript" refer to RNA 
derived from an ORF. 

The term "transposition" means a biochemical reaction that catalyzes the movement 
of a transposable element from one site into different site within a DNA molecule. 
Transposition can be carried out in vivo or in vitro. 

A nucleic acid molecule is "hybridizable" to another nucleic acid molecule, such as a 
cDNA, genomic DNA, or RNA, when a single stranded form of the nucleic acid molecule 
can anneal to the other nucleic acid molecule under the appropriate conditions of temperature 
and solution ionic strength. Hybridization and washing conditions are well known and 
exemplified in Sambrook, J., Fritsch, E. F. and Maniatis, T. Molecular Cloning: A 
Laboratory Manual, Second Edition, Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor (1989), particularly Chapter 1 1 and Table 12.1 therein (entirely incorporated herein 
by reference). The conditions of temperature and ionic strength determine the "stringency" 
of the hybridization. Hybridization requires that the two nucleic acids contain 
complementary sequences, although depending on the stringency of the hybridization, 
mismatches between bases are possible. The appropriate stringency for hybridizing nucleic 
acids depends on the length of the nucleic acids and the degree of complementation, 
variables well known in the art. The greater the degree of similarity or homology between 
two nucleotide sequences, the greater the value of Tm for hybrids of nucleic acids having 
those sequences. The relative stability (corresponding to higher Tm) of nucleic acid 
hybridizations decreases in the following order: RNA:RNA, DNA:RNA, DNA:DNA. For 
hybrids of. greater than 1 00 nucleotides in length, equations for calculating Tm have been 
derived (see Sambrook et aL, supra, 9.50-9.51). For hybridizations with shorter nucleic 
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acids, i.e., oligonucleotides, the position of mismatches becomes more important, and the 
length of the oligonucleotide determines its specificity (see Sambrook et al., supra, 
1 1.7-1 1.8). Furthermore, the skilled artisan will recognize that the temperature and wash 
solution salt concentration may be adjusted as necessary according to factors such as length 
5 of the probe. 

The term "complementary" is used to describe the relationship between nucleotide 
bases that are capable to hybridizing to one another. For example, with respect to DNA, 
adenosine is complementary to thymine and cytosine is complementary to guanine. 
Accordingly, the instant invention also includes isolated nucleic acid fragments that are 
10 complementary to the complete sequences as reported in the accompanying Sequence Listing 
as well as those substantially similar nucleic acid sequences. 

"Gene" refers to a nucleic acid fragment that expresses a specific protein, including 
regulatory sequences preceding (5' non-coding sequences) and following (3 1 non-coding 
sequences) the coding sequence, unless mentioned otherwise. For example, "reporter genes" 
15 used in gene fusion does not include regulatory (promoter) sequences unless specified 
otherwise. "Native gene" refers to a gene as found in nature with its own regulatory 
sequences. 

The terms "chimeric gene", "gene fusion", or "fusion" refer to any non-native gene or 
genes comprising two or more genomic or artificial DNA fragments that are not found in 

20 nature. Accordingly, a chimeric gene, gene fusion or fusion may comprise regulatory 
sequences and coding sequences the are derived from different sources, or regulatory 
sequences and coding sequences derived from the same source but arranged in a manner that 
different than that is found in nature. "Endogenous gene" refers to a native gene in its 
natural location in the genome of an organism. A "foreign" gene refers to a gene not 

25 normally found in the host organism, but that is introduced into the host organism by gene 
transfer. Foreign genes can comprise native genes inserted into a non-native organism, or 
chimeric genes. A "transgene" is a gene that has been introduced into the genome by a 
transformation procedure. 

The term "coding sequence" refers to a DNA sequence that codes for a specific 

30 amino acid sequence. "Suitable regulatory sequences" refer to nucleotide sequences located 
upstream (5' non-coding sequences), within, or downstream (3* non-coding sequences) of a 
coding sequence, and which influence the transcription, RNA processing or stability, or 
translation of the associated coding sequence. Regulatory sequences may include promoters, 
translation leader sequences, introns, and polyadenylation recognition sequences. 

35 The term "promoter" refers to a DNA sequence to which RNA polymerase can bind 

to initiate the transcription. In general, a coding sequence is located 3' to a promoter, 
sequence. Promoters may be derived in their entirety from a native gene, or be composed of 
different elements derived from different promoters found in nature, or even comprise 
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synthetic DNA segments. It is understood by those skilled in the art that different promoters 
may direct the expression of a gene in different tissues or cell types, or at different stages of 
development, or in response to different environmental conditions. Promoters which cause a 
gene to be expressed in most cell types at most times are commonly referred to as 
"constitutive promoters". It is further recognized that since in most cases the exact 
boundaries of regulatory sequences have not been completely defined, DNA fragments of 
different lengths may have identical promoter activity. "Promoter region" is promoter and 
adjacent areas whose function may be modulate promoter activity. 

The "3' non-coding sequences" refer to DNA sequences located downstream of a 
coding sequence and include polyadenylation recognition sequences and other sequences 
encoding regulatory signals capable of affecting mRNA processing or gene expression. The 
polyadenylation signal is usually characterized by affecting the addition of poly adenylic acid 
tracts to the 3' end of the mRNA precursor. 

"RNA transcript" refers to the product resulting from RNA polymerase-catalyzed 
transcription of a DNA sequence. When the RNA transcript is a perfect complementary 
copy of the DNA sequence, it is referred to as the primary transcript or it may be a RNA 
sequence derived from post-transcriptibnal processing of the primary transcript and is 
referred to as the mature RNA. "Messenger RNA (mRNA)" refers to the RNA that is 
without introns and that can be translated into protein by the cell. "cDNA" refers to a 
double-stranded DNA that is complementary to and derived from mRNA. "Sense" RNA 
refers to RNA transcript that includes the mRNA and so can be translated into protein by the 
cell. "Antisense RNA" refers to a RNA transcript that is complementary to all or part of a 
target primary transcript or mRNA and that blocks the expression of a target gene (U.S. 
Patent No. 5,107,065). The complementarity of an antisense RNA may be with any part of 
the specific gene transcript, i.e., at the 5' non-coding sequence, 3* non-coding sequence, 
introns, or the coding sequence. "Functional RNA" refers to antisense RNA, ribozyme 
RNA, or other RNA that is not translated yet has an effect on cellular processes. 

The term "operably linked" refers to the association of nucleic acid sequences on a 
single nucleic acid fragment so that the function of one is affected by the other. For 
example, a promoter is operably linked with a coding sequence when it is capable of 
affecting the expression of that coding sequence- (i.e., that the coding sequence is under the 
transcriptional control of the promoter). Coding sequences can be operably linked to 
regulatory sequences in sense or antisense orientation. 

The term "expression", as used herein, refers to the transcription and stable 
accumulation of sense (mRNA) or antisense RNA derived from the nucleic acid fragment of 
the invention. Expression may also refer to translation of mRNA into a polypeptide. 

The term "transformation" refers to the acquisition of new genes in a cell after the 
incorporation of nucleic acid. 
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The terms "plasmid", "vector", and "cassette" refer to an extra chromosomal element 
often carrying genes which are not part of the central metabolism of the cell, and usually in 
the form of circular double-stranded DNA molecules. Such elements may be autonomously 
replicating sequences, genome integrating sequences, phage or nucleotide sequences, linear 
5 or circular, of a single- or double-stranded DNA or KNA, derived from any source, in which 
a number of nucleotide sequences have been joined or recombined into a unique construction 
which is capable of introducing a promoter fragment and DNA sequence for a selected gene 
product along with appropriate 3' untranslated sequence into a cell. "Transformation 
cassette" refers to a specific vector containing a foreign gene and having elements in addition 
10 to the foreign gene that facilitate transformation of a particular host ceil. "Expression 

cassette" refers to a specific vector containing a foreign gene and having elements in addition 
to the foreign gene that allow for enhanced expression of that gene in a foreign host. 

The term "restriction endonuclease" or "restriction enzyme" refers to an enzyme 
which binds and cuts within a specific nucleotide sequence within double stranded DNA. 
15 The term "bioluminescence" refers to the phenomenon of light emission from any 

living organism. 

The term "relative light unit" is abbreviated "RLU" and refers to a measure of light 
emission as measured by a luminometer, calibrated against an internal standard unique to the 
luminometer being used. 
20 The term "stress" or "environmental stress" refers to the condition produced in a cell 

as the result of exposure to an environmental insult. 

The terms "insult" or "environmental insult" refers to any substance or environmental 
change that results in an alteration of normal cellular metabolism in a bacterial cell or 
population of cells. Environmental insults may include, but are not limited to, chemicals, 
25 environmental pollutants, heavy metals, changes in temperature, changes in pH as well as 
agents producing oxidative damage, DNA damage, anaerobiosis, changes in nitrate 
availability or pathogenesis. 

The term "stress response" refers to the cellular response resulting in the induction of 
either detectable levels of stress proteins or in a state more tolerant to exposure to another 
30 insult or an increased dose of the environmental insult. 

The term "stress gene" refers to any gene whose transcription is induced as a result of 
environmental stress or by the presence of an environmental insults. 

The terms "log phase", "log phase growth", "exponential phase", or "exponential 
phase growth" refer to cell cultures of organisms growing under conditions permitting the 
35 exponential multiplication of the cell number. 

Standard recombinant DNA and molecular cloning techniques used here are well 
known in the art and are described by Sambrook, J., Fritsch, E. F. and Maniatis, T., 
Molecular Cloning: A Laboratory Manual. Second Edition, Cold Spring Harbor Laboratory 
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Press, Cold Spring Harbor, NY (1989) ; arid by Silhavy, T. J., Bennan, M. L. and Enquist, L. 
W., Experiments with Gene Fusions. Cold Spring Harbor Laboratory Cold Press Spring 
Harbor, NY (1 984); and by Ausubel, F. M. et al., Current Protocols in Molecular Biology. 
published by Greene Publishing Assoc. and Wiley-Interscience (1987). 

In this invention, a random set of fragments of E. coli genome was generated by 
partially digesting E. coli genome with the restriction enzyme Sau3Al, and separating the 
fragments by size fractionation. The fraction with an average size of 1 .8 KB was isolated 
and ligated to the plasmid pDEW201 which contains the origin of replication and bla from 
pBR322, four transcription terminators upstream of the promoterless luxCDABE gene. The 
ligation products were transformed into K coli XL2Blue cells. Approximately 24000 
resulting transfdrmants were pooled and used AS the source of heterologous plasmid DNA. 
This plasmid DNA pool was used to transform K coli DPD 1 675 cells. A total of 8066 
individual transformants were isolated and labeled as Lux-A Collection. A homology search 
for the sequence from beginning and end of clones from Lux-A Collection was performed 
against complete E. coli sequence. The location and orientation of each chimeric gene fusion 
with respect to the E. coli genome was determined based on the computed homology. 
Additional sequences for the collection can be added by more random fusions or inverting 
the orientation of DNA in the fusion to put the promoter regions in the right orientation for 
the inserted DNA by using the method that is routine in the art. 

Because E. coli is well characterized and its genome has been sequenced, it is most 
preferred prokaryotic organism for this invention. Other preferred organisms are those 
whose genomes are being sequenced or that have completely sequenced genomes and are 
amenable to introduction of gene fusions by transduction, transformation, or conjugation. 
They include but are not limited to the 1 13 organisms listed on the WEB page, 
http://www.ncbi.nlm.nih.gov/PMGifs/Genomes/bact.html as of 2/15/00. In addition to the 
organisms mentioned above, any microbial organism with at least 15% , preferably 20%, and 
most preferably 50% of its genome sequence known that is amenable to introduction of gene 
fusions by transduction, transformation, or conjugation can be used in this invention to 
generate reporter gene fusions. 

As for the reporter gene or gene complex, luxCDABE is most preferred for its 
sensitivity and simplicity of assay conditions. Other reporter genes well known in the art 
also can be used in this invention. The preferred reporter genes include but are not limited to 
lacZ, gfp, cat, galK y inaZ, luc 7 luxAB, bgaB, nptll.phoA, uidA, and xy IE. 

Methods to introduce fusions into such strains are well known in the art. Fusions can 
be introduced by transduction, transformation or conjugation as appropriate for each specific 
organism. They can be constructed in vitro by techniques including but not limited to 
transposon (e.g., Tn7) mediated transposition, ligating of random fragments to a reporter 
gene construct containing a vector, or ligating of PGR products to the same vector. They can 
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be constructed by in vivo transposition where the transposon is introduced into the cell by 
transduction, transformation, or conjugation. 

The preferred methods to generate a set of fragments of the genome include but are 
not limited to, the partial digestion with a restriction enzyme, physical shearing, polymerase 
5. chain reaction (PCR) amplification, the combination of restriction enzyme digestion and 
PCR, the combination of restriction enzyme digestion and physical shearing, the 
combination of physical shearing and PCR, and the combination of all three. Above methods 
for generating fragments are well known in the art. For example PCR is well described in 
(U.S. Patent No. 4,683,202 (1987, Mullis, et al.) and U.S. Patent No. 4,683,195 (1986, 
10 Mullis, et al.), and restriction enzyme digestion and physical shearing is well described in 
Sambrook et al.; supra. 

Furthermore, chimeric reporter gene fusions can be also made by using transposition 
at the genome level. The above-described methods for generating transpositions are well 
known in the art. 

15 After the sequencing and genome-registering the majority of the Lux-A Collection 

members were completed the biological responses of the strain collection containing 
luxCDABE fusions to members of well-characterized regulatory circuits were examined. 
Thus, gene fusions to the lac operon, and members of the heat shock, SOS, SoxRS, and 
OxyR regulons were selected from the Lux-A Collection and the responses to known 

20 inducers of each of these global regulatory circuits were determined. 

Appropriate biological responses were demonstrated for each of regulons mentioned 
above. The Lux- A Collection contained 3 members that are fusions of hixCDABE to the lac 
operon (Figure 1). These cultures were grown in the presence of glucose and 
bioluminescence was measured and compared to the cultures grown in the absence of 

25 glucose. Cultures grown in the absence of glucose were much more highly bioluminescent 
than the ones in the presence of glucose. 

Four out of twelve heat shock promoters were found as lice fusions. A total of 
6 fusion containing strains were isolated and tested for the stress response in the presence of 
various concentration of ethanol. The heat shock regulon gene fusions from the Lux-A 

30 Collection confirmed induction of heat shock response by ethanol. . 

For SOS regulon, 7 lux fusion were found in the Lux-A Collection. When these 
stains were tested with DNA damaging chemical nalidixic acid, increased level of 
bioluminescence was observed. However, treatment with ethanol did not induce 
bioluminescence in the same culture. 

35 In a similar fashion, although SoxR/S and OxyR regulons were not fully represented 

in the Lux-A Collection* the biologically appropriate response was observed. 
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Thus, while the Lux-A Collection is not comprehensive in containing a fusion 
controlled by each promoter in E. coli, it nonetheless provides a genomic-wide overview of 
transcriptional responses to imposed stresses and can be adapted to optimize responses. 

The effect of mitomycin C (MMC), a known DNA damaging agent, was tested with 
5 DNA microarray and the resulting gene expression pattern was compared with the gene 

expression data from the treated cells of the Lux-A Collection. As expected, the expression 
of the known SOS genes was elevated in micro array data. However the expression of 
several SOS regulon genes was elevated less than 2-fold (Table 12) and as such were within 
a large group of 792 genes the expression of which was elevated by 20% or more. Because 

10 most of these are likely due to artifacts in the array data rather than to actual biologically 
relevant responses, the group of genes with less than two fold increase in expression were 
considered MMC non-inducible genes. Had these experiments been conducted using a 
compound with an otherwise unknown mode of action, some biologically relevant gene 
expression events would have been missed with the DNA micro array approach. To test if 

15 strains carrying luxCDABE gene fusions would yield the expected positive result, the three 
gene fusions that were available in the Lux-A Collection of reporter gene fusions were tested 
for mitomycin C responses. In all three cases, mitomycin C induced increased 
bioluminescence. Thus, this demonstrates that negative results from DNA rnicroarrays can 
be questioned by contradictory positive results with corresponding gene fusions. 

20 The genes not previously known to be upregulated with mitomycin C, provide an 

opportunity to further examine the correlation of DNA array and gene fusion experimental 
data. For this class of genes, four fusions were available in the Lux-A Collection of reporter 
gene fusions. The corresponding luxCDABE fusions to these four genes provided no 
evidence of increased gene expression induced by MMC. Thus, the positive results from the 

25 array were classified as false-positive. 

One gene fusion in the Lux-A Collection of gene fusions was found to have a 
genomic fragment that when inverted would result in a fusion to yebF, a gene observed to be 
upregulated by mitomycin C in DNA micro array experiments. When the genomic DNA 
fragment was released and inserted back into the vector, bioluminescence from some of the 

30 transformed colonies were found to be highly inducible by another DNA damaging agent, 
nalidixic acid, strongly suggesting that the inversion of DNA might have occurred. 
Furthermore, induction of bioluminescence by MMC was demonstrated as shown in 
Figure 2. 

The induction of increased gene expression from the yebF-lux fusion not only 
35 validates the data from the DNA array study, but furthermore, allows facile dose response 
and kinetic analyses and provides a biosensor strain for a high throughput screen. It is 
possible that the DNA damage response reported by yebF-lux fusion might be mediated by 
the well-characterized SOS response because there is a LexA box upstream of yebG (Lomba 
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et al., (1997) FEMS Microbiol Lett 156:1 19-122), which is just upstream of yebF. However, 
there is also suggested to be a promoter that drives transcription of the yebF gene (Blattner 
et al., (1997) Science 277:1453-1462), therefore the induction of yebF expression by MMC 
was unexpected. Thus, these results demonstrate the concept that a previously unknown 
5 gene expression event can be discovered with a DNA microarray, then a corresponding hix 
gene fusion can be used to validate and extend the results. Furthermore, such a lux fusion 
strain can form the basis of a high throughput screen based on gene expression changes. 

The yebF-luxCDABE gene fusion mentioned in the Example 3 can be used as a high 
throughput screen for compounds other than MMC and nalidixic acid that result in DNA 

10 damage. The use of the litxCDABE bioluminescent reporter fusion allows facile detection of 
reporter gene activity in a manner that does not require cell lysis or addition of enzymatic 
substrates. Thus development of a high throughput screen only requires an instrument to 
quantitate light production (of which there are many available commercially) and a source of 
bacterial cell cultures containing the gene fusion of choice. Such bacterial cell cultures can 

15 be supplied by one of several methods. A basic way to use bacterial strains containing gene 
fusions is simply with freshly grown bacterial cell cultures. An alternative to daily 
cultivation is the use of frozen culture aliquots, such as has been successfully demonstrated 
with an E. coli bioluminescent sensor that is competent for gene expression assays 
immediately after thawing, having been previously stored at -80 °C using a cryoprotectant 

20 such as glycerol (Sticher et al., (1997) Appl. Environ. Microbiol 63:4053-4060). Two other 
common methods of handling bacteria, lyophilization and continuous culture, have also 
proven to be useful sources of lux fusion strains for testing purposes. Lyophilization has 
been successfully applied to metal responsive and heat shock responsive cellular biosensor 
strains (Corbisier et al., (1996) Environ. Toxicol. Water Qual 1 1 :1 71-177; Tauriainen et al. 5 

25 (1998) Biosensors & Bioelectronics 13:931-938; Tauriainen et al., (1997) Appl. Environ. 
Microbiol. 63:4456-4461; Van Dyk and Wagner (1998) U.S. 5,731,163; Wagner and Van 
Dyk (1998), Methods in Molecular Biology: Bioluminescence Methods and Protocols, 
vol. 102. Humana Press Lie p.: 123-127). Continuous cultivation of E. coli bioluminescent 
biosensors in mini bioreactors has been shown to yield reproducible detection of stress 

30 responses (Gu et al., (1996), Biotechnol. Prog. 12:393-397; Gu et al., (1999) Biosens. 
Bioelectron. 14:355-361). 

Other possible methods to supply cellular biosensors for high throughput screens are 
immobilization by entrapment in a carrier material or use of a bioluminescent bioreporter 
integrated circuit (BBIC). Strontium alginate immobilization of a lux fusion containing 

35 bacterial strain has been demonstrated for use as a probe for waste streams (Heitzer et al., 
(1 994) Appl. Environ. Microbiol. 60: 1 487-1494; Matrubutham et al., (1 997) Appl. 
Microbiol Biotechnol 47:604-609; Webb et al, (1997) Biotechnol. Bioeng. 54:491-502). In 
another example of immobilization, calcium alginate beads, harboring an SOS responsive lux 
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fusion strain, stored in a CaCl 2 solution at"4°C were found to give useful DNA damage 
induction responses for up to one month after formation (Davidov et aL, in press). Likewise, 
calcium alginate immobilization of a copper-responsive biosensor results in superior stability 
of the biosensor relative to the immobilization of the same biosensor in agarose (de Lorenzo 
5 et aL, (1999) Anal. Chim. Acta 387:235-244). Furthermore, combining of immobilized cells 
and light detection equipment is possible for sensors that produce visible light as a signal. In 
one such case, an E. coli luc fusion strain is immobilized on the end of fiber optic monitoring 
device (Ikariyama et aL, (1997) Anal Chem. 69:2600-2605). Finally, in the BBIC approach, 
which takes advantage of the cellular signal generated by the five gene luxCDABE reporter, a 
10 bioluminescent biosensor strain is deposited onto a micro-luminometer fabricated within an 
integrated circuit; the light produced by the biosensor is detected by the integrated circuit, 
which then processes and communicates the results (Simpson et aL, (1998) Soc. Opt. Eng. 
3328 (Smart Electronics and MEMS):202-212; Simpson et aL, (1998) TIBTECH 
16:332-338). 

15 Therefore, development of a DNA damage responsive high throughput screen based 

on the newly discovered yebF-luxCDABE fusion is readily accomplished by choosing one of 
the above known methods to provide an active cellular biosensor and combining it, if 
necessary, with an instrument that measures visible light production. Other promoters from 
a stress response gene may be used to generate high throughput screens using a lux fusion. 

20 Stress response gene promoters from both prokaryotic and eukaryotic cells may be used, 
however promoters from bacteria are preferred and promoters from E. coli are most 
preferred. Suitable stress response gene promoters may be selected from but are not limited 
to the list of genes under the heading "responding genes" given in Table 1 below, and other 
newly discovered regulatory circuits! 
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TABLE 1 

REGULATORY REGULATORY RESPONDING 
STIMULUS GENE(S) CIRCUIT GENES* 



Protein 
Damage 3 


rpoH 


Heat Shock 


grpE, dnaK, 
Ion, rpoD, groESL, 
lysU, htpE, htpG, htpl, 
nipjs*, cipr-, cipJo, 
htpN, htpO, htpX, etc. 


uina. Damage u 


lexA, recA 


CAC 

uUo 


vbcA, uvrA, 
lexA, umuDC, uvrA, 
uvrB, uvrC, sulA, 
recN, avrD, ruv, dinA, 
dinB, dinD, dinF etc. 


Oxidative 
Damage 0 


oxyR 


Hydrogen 
Peroxide 


katG, ahp, etc. 


Oxidative 
Damage a 


soxRS 


Superoxide 


micF, sodA, 
nfo, zwf, soi, etc. 


Membrane 
Damage e 


fadR 


Fatty Acid 
Starvation 


fabA 


Any f 




Universal 
Stress 


uspA 


Stationary 
Phases 


rpoS 


Resting State 


xthA, katE, 
appA, mcc, bolA, 
osmB, treA, otsAB, 
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Genes whose expression is increased by the corresponding stimulus and whose expression is 
controlled by the corresponding regulatory gene(s). 
a Neidhardt and van Bogelen in E. coli and Salmonella typhimurium; Cellular and Molecular 
Biology (Neidhardt, F.C., et al. Eds., pp. 1334-1345, American Society of Microbiology, 
Washington, DC (1987)) 

b Walker in E. coli and Salmonella typhimurium; Cellular and Molecular Biology (Neidhardt, F.C., 
et al. Eds., pp. 1346-1357, American Society of Microbiology, Washington, DC (1987)) 

c Christman et al. Cell 41: 753-762 (1985); Storz et al Science 248: 189-194 (1990); Demple, 
Ann. Rev. Genet. 25: 3 1 5-337 (1 99 1) 

d Demple, Ann. Rev. Genet. 25: 31 337 (1991) 

e Magnuson et al. Microbiol. Rev 57: 522-542 (1993) 

f Nystrom and Neidhardt, J. Bacteriol, 175: 2949-2956 (1993); 
Nystrom and Neidhardt (MoL Microbiol. 6: 3187-3198 (1992) 

g Kolter et al. Ann. Rev. Microbiol. 47: 855-874 (1993) 

n Cashel and Rudd in E. coli and Salmonella typhimurium; Cellular and Molecular Biology 

(Neidhardt, F.C., et al. Eds., pp. 1410-1438, American Society of Microbiology, Washington, DC 
(1987)); Winkler in E. coli and Salmonella typhimurium; Cellular and Molecular Biology 
(Neidhardt, F.C., et al. Eds., pp. 395-41 1, American Society of Microbiology, Washington, DC 
(1987)) 

1 Neidhardt, Ingraham and Schaecter. Physiology of the Bacterial Cell: A Molecular Approach, 
Sinauer Associates, Sunderland, MA (1990), pp 351-388; Magasanik and Neidhardt in E. coli and 
Salmonella typhimurium; Cellular and Molecular Biology (Neidhardt, F.C., et al. Eds., pp. 
1318- 1 325, American Society of Microbiology, Washington, DC ( 1 987)) 

J Wanner in E. coli and Salmonella typhimurium; Cellular and Molecular Biology (Neidhardt, F.C., 
et al. Eds., E. coli and Salmonella typhimurium; Cellular and Molecular Biology (Neidhardt, F.C., 
et al. Eds., pp. 1326-1333, American Society of Microbiology, Washington, DC (1987)) 

k Rietzer and Magasanik in E. coli and Salmonella typhimurium; Cellular and Molecular Biology 
(Neidhardt, F.C., et al. Eds., pp. 1302-1320, American Society of Microbiology, Washington, DC 
(1987)); Neidhardt, Ingraham and Schaecter. Physiology of the Bacterial Cell: A Molecular 
Approach, Sinauer Associates, Sunderland, MA (1990), pp 351-388 

Fusions to lice reporter genes can also be used to support identification of the 
operonic structures, newly discovered regulatory circuits, and promoter sites. Since the 
random library of E. coli genomic DNA was fused to the promoterless lux gene, the strength 
of bioluminescence in the resulting fusions should depend on the promoter from the E. coli. 
For example, if the fusion contains the complete promoter, lux gene expression would be 
stronger than the fusion that contains a partial region of same promoter. If the promoter is 
truncated to the point that it is non-functional, the lux gene expression would be absent. 
Such result can support the postulated operonic structures and dependence of promoter 
function in host (Example 4). 

The construction and sequencing of a random library of E. coli genomic fragments in 
plasmid pDEW201 has generated a genome-registered set of promoter activity reporter 
constructs. A highly parallel, solid phase cellular assay has been developed utilizing a 
subset of these reporters by combining bioinformatic analysis and robotic manufacturing of 
arrays and high throughput culture production. This assay has the capacity, utilizing 
standard robotic tools, to monitor promoter activity of entire microbial genomes in duplicate, 
or alternatively entire genomes of different microbes on the same array. The output of the 
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assay is a grayscale image and is compatible with commercial products for image analysis 
and data analysis developed specifically of DNA microarray technologies. 

Briefly, the assay involves creating a nonredundant collection of clones containing 
reporter constructs. These cells are grown to stationary phase and robotically printed at high 
5 density onto a porous membrane (e.g., Biodyne B Nunc). Any contact or non contact 

printing robot (e.g., Biomek 2000, Beckman) may be used for printing. The membrane is in 
close contact with solid media. A key aspect of this invention is the ability to move the 
membrane from one surface to another surface containing different media. The ability to 
measure the luminescence as a function of regulatory region activity from the cells grown in 

10 the solid surface was surprising and unexpected. Previously, LaRossa and Van Dyk 

(U.S. 6,025,13 1) reported that the method to detect the activity of regulatory regions as 
reporter gene activity in suitable hosts was restricted to the cells grown in the liquid media. 
Growth in solid media allows one to grow the cells to a desired density prior to perturbation 
then follow the kinetics of the response. Experimental protocols often involve perturbations 

15 that prohibit long term exposure due to cell death or other irreversible effects. The ability to 
move the entire array to new growth conditions allows one a great variety of experimental 
schemes including, but not limited, to pulsed or pulse/chase exposures, reversibility, and 
short term kinetic studies. Effects of perturbants can be determined by comparison of 
luminescence generated by treated and control cultures. 

20 Several important characteristics of the assay system needed to be evaluated. In 

particular, growth density and conditions, sensitivity, reproducibility, and the ability to 
perturb the reporters and detect changes were major focus points. 

Using DNA damage in E. coli as a model system, the assay was first developed with 
a small set of well characterized clones to evaluate the robustness of the approach. The set 

25 of 10 clones were tested for the response to DNA damaging agent, nalidixic acid. The 
expected nalidixic induced upregulation of genes in the SOS regulon was detected as 
increased bioluminescence (Figure 4B). The level of bioluminescence from the strains 
containing fusions to genes that are not responsive to DNA damage were not affected by the 
nalidixic acid treatment (Figure 4A). 

30 The clone set of Luxarray 0.5 was used to develop a highly parallel solid phase assay 

by printing the clones at high density. This invention allows collection of an image of the 
signal generated from reporter constructs such that the signal intensity can be subsequently 
quantified. This requires not only that the collection parameters (focal plane, magnification, 
integration time, and algorithm) are constant but also that the downstream image analysis 

35 software has the ability to process the images generated. Chemical perturbation, one of the 
utility of this invention, requires physically relocating a membrane from one culture plate to 
another. This results in images with minimal X-Y positional registration. Several 
commercially available products can efficiently process these images. Array Vision™ 
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(Imaging Research, Toronto, Canada) and ImageQuant (Molecular Dynamics, Sunnyvale, 
CA) are two examples of appropriate software packages. It is preferred that a cooled CCD 
camera is used to capture the data. 

Computational filters were applied to the genome-registered clone collection to 
5 generate a nonredundant set of reporter constructs representing approximately 28% of the 
promoters in the genome. Even at this level of coverage, it is likely that at least one 
representative promoter of all the regulons in E. coli is present in the collection. This large 
set was used in DNA damage perturbation experiments to confirm the response of several 
promoters. As found with the low density experiments quantified with the luminometer, the 

10 expected responses for each clone were well demonstrated. The five documented DNA 
damage-responsive reporter constructs clearly show an upregulation of expression. In 
contrast, light production from the strain carrying the lac promoter fusion as well as several 
strains carrying other promoter fusions was decreased. 

A high density cellular array (Lux array 1.0) in this invention was also used to 

15 identify a number of putative previously unknown DNA damage-responsive operons. When 
treated with the DNA damaging chemical, nalidixic acid, in addition to the expected 
promoter activity (i.e., SOS regulon), several previously undescribed promoters 
demonstrated a similar behavior. This invention can be used to monitor transcriptional 
changes in high throughput manner. 

20 This invention also provides a method to identify the fusions that would be useful 

except that the orientation of the chromosomal DNA is inverted relative to the reporter 
genes, such that the promoter regions of interest are not operably linked to the reporter 
genes. Selecting such fusions and inverting the orientation of the insert DNA can 
significantly enhance the utility of a sequenced collection by adding many more operable 

25 linked fusion to the collections. A simple way to do this is to digest the plasmid DNA with a 
single restriction enzyme that cuts just outside the cloned region and religate the pieces. 
Although a mixture of plasmids results from this procedure, in many cases the correctly 
oriented plasmid can be found because cells containing it, but not other possible products, 
will produce light. This inverting method was also used to add gene fusions to the Lux- A 

30 Collection. Other methods besides light production can be used (notably PCR) to identify 
the orientation or size of the insert. 

Additionally, specific PCR primers can be designed for any region of interest on the 
chromosome for which sequence data is available. For E. coli the entire genome is available. 
Therefore by utilizing positional and directional coordinates for known and predicted 

35 promoters, specific primers were designed for the purpose of generating a PCR product 
containing each promoter. The actual promoter was assumed to lie in a 400 basepair 
genomic fragment ending at the translational start codon position of the first open reading 
frame in the operon. Primer pairs were selected from this region for each operon such that 
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the "right" primer was forced to include the start codon of the first open reading frame of the 
operon (Primer3 5 Rozen and, Skaletsky (1996,1997,1998). Code available at 
http://www-genome.wi.mit.edu/genome_software/other/primer3.hto Acceptable PCR 
primer pairs were identified for 80% of the operons using the default Primer3 parameters 
with an optimal melting temperature, Tm, of 68 °C. Primer pairs for the remaining operons 
can be identified by either relaxing one or more parameters, expanding the search area or by 
using a different algorithm.' The final version of the primers also includes addition of 
appropriate EcoRI and Sad endonuclease cleavage sites (left and right primers respectively) 
such that the PCR products can be directionally cloned into pDEW201. 

Still other methods to generate gene fusions for a genome-registered collection result 
in chromosomal rather than plasmid-borne gene fusions. Transposable genetic elements 
carrying reporter genes can be used to generate gene fusions. If this is done by random 
transposition into the chromosome of the host organism, the subsequent gene fusions can be 
genome-registered with respect to the chromosomal DNA sequence by determining the 
sequence of the junction between the transposable element and the hromosome (Nichols 
et al., (1998) J. BacterioL 180:6408-641 1). Alternatively, transpositions done in vitro using 
a defined (and thus genome-registered) segment of chromosomal DNA can be recombined 
by homologous recombination into the host chromosome. Furthermore, gene fusions formed 
in vitro by other methods, such as clones in plasmid DNA, can be recombined into the 
chromosome of the host by homologous recombination (Balbas et al., (1996) Gene 
172:65-69; Lloyd and Low (1996) In Escherichia coli and. Salmonella: Cellular and 
Molecular Biology. ASM Press, pp 2236-2255), site specific recombination (Nash H. (1996) 
In Escherichia coli and Salmonella: Cellular and Molecular Biology. ASM Press, 
pp 2363-2376) or both (Boyd et al., (2000) J. BacterioL 182:842-847). 

The preferred methods to generate gene fusions include are but not limited to the use 
of plasmid DNA from clones requiring reorientation as template for the PCR to generate 
inversion, use of specific PCR primers when sequence data is available, or use of 
transposition to generate gene fusions in chromosome (Kenyon and Walker (1980) Proc. 
Natl Acad. Set U.S.A. 77:2819-2823). The most preferred method to generate genomic 
fragments is using restriction enzyme or physical shearing. The genomic fragments are then 
ligated to promoterless reporter gene to generate gene fusion. The above methods to 
generate fusions are well known in the art. 

This invention also provides a method to use the genome registered collection of 
gene fusions in a liquid format to discover gene fusions useful as biosensors. For example, 
39 gene fusions were found to be upregulated when exposed to limonene for 135 minutes 
(Example 7). By use of these gene fusion in a strain to be used as the production organism 
for limonene, the presence of limonene, or the optimum conditions for the limonene 
production can be measured by the upregulation of bioluminescence. 
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Gene expression profiles yield information relevant to understanding gene function 
and modes of chemical action. Likewise, such information can be gained by analysis of 
genetic alterations resulting in loss of function, reduced levels, or over-expression of gene 
products. Thus, an "array of arrays" can be used to enhance both mode of action studies and 
5 functional genomics. Flow diagrams I and II depict two ways such arrays of arrays can be 
used. 

Flow diagram I represents the several tests can be performed on a given perturbation 
that changes the environment of the cell. 

10 Flow Diagram I 
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empirical pattern matching or (2) prediction of gene function 

Flow diagram II describes that as data from arrays of arrays is analyzed, altered 
responses of interest can be further analyzed by selected tools in the array of arrays. 
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These arrays of arrays can be built by generating large collections of genome- 
registered mutations in genes of an organism. Several methods are available including but 
not limited to classical radiation and chemical induced mutagenesis as well as more modern 

10 genomic-based techniques including random in vivo transposition followed by sequencing of 
junctions to determine the gene disrupted, targeted in vitro transposition into individual 
genes followed by homologous recombination in vivo to generate disruptants, and primer 
oligonucleotide generated deletion insertion alleles generated in vitro by PCR and 
subsequently recombined into the genome. Spontaneous mutants can also be selected by a 

15 variety of methods (LaRossa, R. A. (1996) In Escherichia coli and Salmonella: Cellular and 
Molecular Biology. ASM Press, p. 2527-2587.). Likewise, a large collection of genome- 
registered genetic alterations that result in over-expression can be generated. This can be 
accomplished in several ways including but not limited to genetic selection (LaRossa, R. A. 
(1 996) In Escherichia coli and Salmonella: Cellular and Molecular Biology. ASM Press, 

20 p. 2527-2587), cloning on multicopy plasmids, placement of the gene to be over-expressed 
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behind a strong promoter, and placement of the gene to be over-expressed behind a regulated 
promoter. 

Perturbations are any alteration of environment or genotype. Perturbations that 
change the environment include but are not limited to physical properties, such as radiation 
5 fluence, radiation spectrum, humidity, substratum, or temperature; nutritional properties, 
such as carbon source, energy source, nitrogen source, phosphorus source, sulfur source, or 
trace element sources; biological properties, such as presence of competitors, predators, 
commensals, pathogens such as phage and other viruses, the presence of toxins, or 
bacterocins; and chemical properties, such as presence of chelators, inhibitors, toxicants or 

10 abnormal levels of normal metabolites.. 

Several tests can be performed on a given perturbation that changes the environment 
of the cell (Flow Diagram I). Responses include patterns of gene expression (e.g., reporter 
gene expression, presence or absence of specific protein or intermediates) and phenotypic 
effects of genetic alterations; these responses can be analyzed concomitantly. Examples of 

15 pheno types that may be screened for in the present method include but are not limited to 
metabolic capacity (e.g., carbon source requirement, auxotroph requirement, amino acid 
requirement, nitrogen source requirement, and purine requirement); Resistance to inorganic 
chemicals (e.g., acid, arsenate, azide, heavy metals, and peroxide); Resistance to organic and 
biological chemicals (e.g., antibiotics, Acridine, actinomycin, amino purine, amino 

20 phenylalanine, colicin, ethanol, fluoroacetate, mitomycin C, and nalidixic acid); Resistance 
to biological agents (e.g., phages); Resistance to physical extremes (e.g., temperature, pH, 
osmotolerance and radiation). The phenotypes amenable to detection by the present 
invention are numerous and a full review may be found in, Robert LaRossa: Escherichia 
coli and Salmonella: Cellular and Molecular Biology (1996) ASM press p. 2527-2587). This 

25 will yield an enhanced empirical matching of one perturbation to another. For instance if 
one chemical yields a very similar pattern of effects in all tests to another chemical, then the 
likelihood of a similar mode of action of the two chemicals is high. Secondly, such 
concomitant analysis of several patterns will enhance understanding of gene function. For 
example, if a group of genes is regulated similarly by environmental perturbation and genetic 

30 perturbation (i.e., mutation) in this group of genes have similar phenotypic effects, then 
similar function can be hypothesized. 

Flow Diagram II depicts that as data from arrays of arrays is analyzed, altered 
responses of interest can be further analyzed by selected tools in the array of arrays. For 
example, one can evaluate any perturbation by asking which mutants are hypersensitive, or 

35 hyper-resistant to the environmental change. Look at the gene expression profile of wild 
type and the altered mutants in response to the environmental change and in its absence. 
Another approach to examine genetic changes is to compare the genome-registered 
collection of mutants to the wild type by examining how growth characteristics vary between 
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the mutants and wild type with changes in a wide range of environmental parameters. 
Differences of interest are then followed up with gene expression profiling. 

As arrays of arrays are utilized, the massive amount of data on phenotypes, which 
results from interactions between genotypes and the environment and found by changing 
5 either the genetic composition or the culture conditions, will allow interpretation of the 
interplay between mutants and gene expression profiles. Analysis of such interactions will 
be also useful for discovery of gene function and determining the modes of chemical action. 
Furthermore, these analyses may lead to identification of useful targets for pharmaceuticals, 
antimicrobials, or agrochemicals, development of environmental diagnostic tests, or 
10 development of high throughput screen based on modes or sites of chemical action. 

EXAMPLES 

The present invention is further defined in the following Examples. It should be 
understood that these Examples, while indicating preferred embodiments of the invention, 
are given by way of illustration only. From the above discussion and these Examples, one 
15 skilled in the art can ascertain the essential characteristics of this invention, and without 

departing from the spirit and scope thereof, can make various changes and modifications of 
the invention to adapt it to various usage and conditions. 

GENERAL METHODS 
Standard recombinant DNA and molecular cloning techniques used in the Examples 
20 are well known in the art and are described by Sambrook, J., Fritsch, E. F. and Maniatis, T. 
Molecular Cloning: A Laboratory Manual; Cold Spring Harbor Laboratory Press: Cold 
Spring Harbor, (1989) (Maniatis) and by T. J. Silhavy, M. L. Bennan, and L. W. Enquist, 
Experiments with Gene Fusions, Cold Spring Harbor Laboratory, Cold Spring Harbor, N. Y. 
(1984) and by Ausubel, F. M. et al., Current Protocols in Molecular Biology, pub. by Greene 
25 Publishing Assoc. and Wiley-Interscience (1987). 

The meaning of abbreviations is as follows: "KB" means kilobase(s) "hr" means 
hour(s), "min" means minute(s), "sec" means second(s), "d" means day(s), "ml" means 
milliliter(s), "^1" means microliter(s), "nl" means nanoliter(s), "|J.g" means microgram(s), 
"ng" means nanogram(s), "mM" means millimolar, "|iM" means micromolar. 
30 Media and Culture Conditions : 

Materials and methods suitable for the maintenance and growth of bacterial cultures 
were found in Experiments in Molecular Genetics (Jeffrey H. Miller), Cold spring Harbor 
Laboratory Press (1972), Manual of Methods for General Bacteriology (Phillip Gerhardt, 
R.G.E. Murray, Ralph N. Costilow, Eugene W. Nester, Willis A. Wood, Noel R. Krieg and 
35 G. Briggs Phillips, eds), pp. 210-213, American Society for Microbiology, Washington, DC. 
or Thomas D. Brock in Biotechnology: A Textbook of Industrial Microbiology . Second 
Edition (1989) Sinauer Associates, Inc., Sunderland MA. All reagents and materials used 
for the growth and maintenance of bacterial cells were obtained from Aldrich Chemicals 

30 




WO 01/79419 PCT/US01/11855 

(Milwaukee, WI), DIFCO Laboraoties (Detroit, MI), Gibco/BRL (Gaithersburg, MD), or 
Sigma Chemical Company (St. Louis, MO) unless otherwise specified. 

LB medium contains following per liter of medium: Bacto-tryptone (10 g), Bacto- 
yeast extract (5 g), and NaCl (10 g). 
5 Vogel-Bonner medium contains the following per liter: 0.2 g MgS04.7H 2 0, 2 g 

citric acid.lH 2 0, 10 g K 2 HP0 4 and 3.5 g NaHNH 4 P0 4 .4H 2 0. 

Minimal M9 medium contains following per liter of medium: Na 2 HP0 4 (6 g), 
KH 2 P0 4 (3 g), NaCl (0.5 g), and NH 4 C1 (1 g). 

Above media were autoclaved for sterilization then 10 mi of 0.01 M CaCl 2 and 1 ml 
10 of 1 M MgS0 4 .7H 2 0 were added to M9 mediums. Carbon source and other nutrient were 
added as mentioned in the examples. All additions were pre-sterilized before they were 
added to the media. 
Molecular Biology Techniques : 

Restriction enzyme digestions, ligations, transformations, and methods for agarose 
15 gel electrophoresis were performed as described in Sambrook, J., et al., Molecular Cloning: 
A Laboratory Manual, Second Edition, Cold Spring Harbor Laboratory Press (1989). 
Polymerase Chain Reactions (PCR) techniques were found in White, B., PCR Protocols: 
Current Methods and Applications, Volume 15(1993) Humana Press Inc. 

EXAMPLE 1 

20 Construction., sequencing and registering a random library of 

K coli genomic fragments fused to a luxCDABE reporter 
The random library of E. coli genomic fragments in plasmid pDEW201, which 
contains the origin of replication and bla from pBR322, four transcription terminators 
upstream of the promoterless P. luminescens luxCDABE genes, and a multiple cloning site 
25 that lies between the terminators and luxCDABE were constructed as previously described 
(Van Dyk et ah, (1998) J. BacterioL 180:785-792; Van Dyk and LaRossa (1998) Methods in 
Molecular Biology: Bioluminescence, Methods and Protocols, Humana Press Inc. 
vol. 102:85-95). 

Briefly, chromosomal DNA isolated from E. coli strain W3 110 (Ernsting et al., 
30 (1992) 7. BacterioL 174:1 109-1 118) was partially digested with the restriction enzyme 

Sau3A\ 7 size fractionated by agarose gel electrophoreses, and a fraction with an average size 
of approximately 1.8 KB isolated. This fraction was ligated to pDEW201 that had 
previously been digested with BamHl and treated with calf intestinal alkaline phosphatase. 
The ligation products were used to transform ultracompetent E. coli XL2Blue cells 
35 (Stratagene) to ampicillin resistance using the protocol provided by Stratagene. Preliminary 
characterization of individual XL2Blue transformants that were picked in random indicated 
that a large percentage (16 of 16) contained insert DNA with sizes ranging from 0.9 to 
3.0 KB. Approximately 24,000 of these transformants were pooled and used as a source of 
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heterogeneous plasmid DNA isolated using Qiagen tip20 columns (Qiagen Corp). This 
plasmid DNA pool was used to transform E. a?/z'DPD1675 (Nishimura et al. (1990). Nucleic 
Acids Res. 18:6169; Van Dyk and LaRossa (1998) Methods in Molecular Biology: 
Bioluminescence Methods and Protocols, Humana Press inc. vol. 102:85-95) selecting for 
5 ampicillin resistance and using a 30 minute phenotypic expression time to minimize the 
presence of siblings. A total of 8066 individual transformants were used to inoculate the 
96-wells of sterile Microtest III™ Tissue Culture Plates (Falcon®) each containing 190 p.1 of 
Vogel-Bonner medium (Davis et aL, (1980) Advanced bacterial genetics. Cold Spring 
Harbor Laboratory, Cold Spring Harbor, NY) with glucose as a carbon source and 
10 supplemented with thiamine, uracil, proline, and 25 |ig/ml of ampicillin. These plates were 
covered and incubated overnight without shaking at 37 °C. The overnight cultures in 
96-well plates were used for permanent cryogenic storage in duplicate at -80 °C (Menzel, R., 
(1989) Anal Biochem. 181 :40-50). These 8066 individual cultures are called the Lux-A 
Collection. 

15 For DNA sequence analysis, one of the duplicate sets of the Lux-A Collection was 

thawed and used as an inoculum for cultures grown to saturation in Terrific Broth (Gibco- 
BRL, Inc) containing 100 \ig/m\ ampicillin in 96 well deep-well plates. Plasmid DNA was 
extracted from the cultures using the Qiagen R.E.A.L.™ prep method with the following 
modification: after lysis of cells, the plates were placed in a boiling water bath for 5 minutes 

20 and then rapidly chilled in an ice-water bath before precipitation with Buffer 3. This 

modification prevented degradation of the plasmid DNA by the nucleases present in the non- 
endA host strain, DPD1675. DNA sequencing reactions were performed with approximately 
1 jug of plasmid DNA under standard ABI Prism™ DyeTerminator Reaction Ready 
conditions with the primers pDEW20Lforward (SEQ ID NO:l, 

25 5 '-GGATCGGAATTCCCGGGGAT-3 ') and pDEW201 .reverse (SEQ ID NO:2, 

5 , -CTGGCCGTTAATAATGAATG-3 l ) to obtain sequence information from each end of the 
insert. DNA sequences were determined on ABI377™-XL 96-lane upgraded Sequencers 
under 4x run conditions on 5% PAG (polyacrylamide gel) LongRanger™ (FMC, Inc.) gels 
and analyzed with ABI software. DNA sequences were transferred to a UNIX based utility 

30 for further analysis. A homology search for the sequence from the beginning and end of 

each- Lux-A clone (in both orientations) was performed against the complete E. coli sequence 
(Genbank accession U00096) using Pearson's FASTA program (fasta3 ? Version 3.1tl3). The 
essential Fasta options were -nQH -m 10-z0. 

The essential data about each highly significant alignment (FASTA score >1000, 

35 minimum overlap length >200, and minimum identity >70%) was stored in a relational 
database (Sybase System 1 1, Sybase Inc.). 

The location of Lux-A clone on the E. coli genome was then based on the above 
computed homologies for both the beginning and end of the insert, using the following rules: 
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i) Both distal and proximal ends of the insert must have an unambiguously 
high sequence homology with E. coli, 

ii) the relative location and orientation of the matches of the sequence 
determined from the beginning and end of the clone implied a reasonable length for the 

5 clone, which were known to fall in a fairly narrow distribution. 

In many cases the above procedure gave a single probable location, but in others 
there were multiple possible locations. The results were stored in the relational database. 

A table of open reading frame annotations for E. coli was downloaded using NCBFs 
Entrez facility, and this data was stored in the relational database. 
10 A web based, tabular interface to the data was created. This allows one to see the 

Lux-A clones in relation to the functional annotation. A Java based graphical interface was 
also created to make positional and directional relationships easy to visualize. Thus the 
Lux-A Collection was registered to the E. coli genome. 

EXAMPLE 2 

15 Validation of Lux-A Collection by verification of selected global regulatory responses 

With the sequencing of Lux-A Collections and registering of the majority of the 
LuxA members, it was possible to further examine the biological responses of strains 
containing luxCDABE fusions to other members of well-characterized regulatory circuits. 
Gene fusions to the lac operon, and members of the heat shock, SOS, SoxRS, and OxyR 

20 regulons were selected from the Lux-A Collection and the responses to known inducers of 
each of these global regulatory circuits were tested. 
Growth media and chemicals . 

A rich liquid medium, LB (Miller, J. H., (1972) Experiments in molecular genetics. 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY), was used. When specified, 

25 ampicillin (Amp) was added at 150 ng/ml. Ethanol (200 proof, Quantum Chemicals) was 
diluted directly into LB medium. A stock solution of 20 mg/ml nalidixic acid (Sigma 
Chemical Co.) in 1 M NaOH was further diluted into LB medium. Likewise, a stock 
solution of 1 00 mg/ml methyl viologen (Sigma Chemical Co.) in water was further diluted 
into LB medium. A 30% solution of hydrogen peroxide (EM Science) was diluted into LB 

30 medium to the desired concentrations. 

Re-isolation from microplates, growth of cultures, and testing stress responses . 

Selected strains were reisolated from the Lux-A Collection microplates stored at — 
80 °C by streaking for single colonies on LB-Amp plates. The purity of the cultures in the 
wells was tested by inoculating four single colonies into 100 \x\ of LB medium in a 96 well 

35 luminometer plate (Microlite, Dynex Technologies). Consistent bioluminescence of each set 
of four isolates provided evidence of clonal purity. 

To test the stress responses, one or two single colonies of each selected strain were 
used to inoculate 5.0 ml of LB medium containing 150 |ag/ml of ampicillin. These cultures 
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were grown overnight at 37 °C then diluted by adding 50 juil or 100 |il of the overnight 
culture into 10.0 ml of fresh LB medium (without ampicillin) and grown with agitation at 
37 °C until the culture was in exponential phase with readings on a Klett-Summerson 
colorimeter containing the red filter of between 10 and 40. These actively growing cultures 
5 were immediately used to initiate a stress response experiment by adding 50 ul of cultures to 
50 ul of LB medium containing various concentrations of chemical in the wells a 96 well 
liuninometer plate. This division of the actively growing culture at the time of chemical 
addition ensured identical populations were present when the stress was imposed. Light 
production was measured in a Dynex ML3000 luminometer at 37 °C. The dimensionless 

10 units of light production, relative light units (RLU), are obtained by comparison with the 
light reading from an internal light-emitting diode. The cycle mode of the ML3000 
luminometer, similar to previous descriptions (Van Dyk et al. 5 Appl Environ, Microbiol. 
60:1414-1420) was used. 
Gene fusions to the lacZYA operon . 

15 Transcription of the very well characterized lac operon is regulated by both specific 

and global regulatory circuits. Specific, negative regulation is mediated by the focJ-encoded 
repressor (Choy and Adhya, (1996) In Escherichia coli and Salmonella: Cellular and 
Molecular Biology. ASM Press pp 1287-1299). Global regulation in response to glucose 
availability is mediated by positive transcriptional activation of cAMP-CRP (Botsford and 

20 Harman (1 992) Cyclic AMP in prokaryotes. Microbiol Rev. 56 : 1 00- 1 22). The Lux-A 

Collection contains three members that are fusions of luxCDABE to the lac operon as shown 
in Figure 1. To test for the appropriate response of these gene fusions to glucose, each was 
reisolated and tested for bioluminescence when grown in the presence or absence of glucose 
in LB medium. The results are shown in Table 2. 



TABLE 2 

Fusions to the lac operon in the Lux-A Collection. 
RLU of 1 00 ul overnight cultures 



Lux Clone 


LBAmp 150 +0.4%glucose 


LBAmplSO 


Iux-a.pk034.a6 (a) 


0.002 


23.0 


Iux-a.pk034.a6 (b) 


0.001 


18.8 


Iux-a.pk050.b9 (a) 


0.001 


26.0 


Iux-a.pk050.b9 (b) 


0.001 


0.007 


Iux-a.pk065.d4 (a) 


0.001 


19.1 


Iux-a.pk065.d4 (a) 


0.001 


19.6 



Two isolated single colonies of each culture were tested by growing overnight in the 
specified medium then measuring the bioluminescence of 100 jil in an ML3000 
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luminometer. LB Amp medium is LB medium containing ampicillin at 150 mg/ml 
concentration. 

With one exception, each of the isolated colonies from the cultures in the Lux-A 
5 Collection was much more highly bioluminescent in the absence of glucose than in its 

presence. One colony from the Iux-a.pk050.b9 that was not highly bioluminescent might be 
due to contamination of the culture in that well. Other putative cross contamination events 
have not been observed. Each of the lacZ-luxCDABE fusion strains had at least one 
reisolated colony that gave the expected response to glucose. Thus, the appropriate 
10 biological response of these Lux-A Collection members was verified. 
Heat shock regulon gene fusions . 

The Lux- A Collection was examined to find fusions to genes in the <r 32 -controlled 
heat shock regulon (Gross, C. A., (1996) In Escherichia coli and Salmonella: Cellular and 
Molecular Biology, Second ed. ASM Press, pp. 1382-1399). Table 3 shows the genes or 
15 operons for which the Lux-A Collection was searched and the number of gene fusions found 
for each gene. 

TABLE 3 



Lux-A Collection Gene Fusions to Heat Shock Regulon members 



Heat shock gene or operon 


Number of Fusions in the Lux-A Collection 


grpE 


0 


Ion 


1 


clpPX 


0 


dnaKJ 


0 


groESL (mopAB) 


0 


rpoD 


3 


htpG 


0 


clpB 


1 


htpX 


0 


htgA (htpY) 


0 


hslT(ibpA) 


0 


rfaDFCL (htrM) 


1 



While there was not complete representation of the each member of the heat shock 
20 regulon, lux fusions to four of twelve heat shock promoter (33%) were found. Thus, the 

Lux-A Collection contains members that should report on activation of this stress response. 
In addition, strains with the previously constructed grpE-luxCDABE gene fusion in parental 
plasmid pDEW201 (Van Dyk et aL, (1998) J. BacterioL 180:785-792) had been placed in 
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selected wells of the Lux-A Collection plates as a control. One of these was also selected for 
testing. Table 4 summarizes the information on the six strains that were reisolated from 
wells of the frozen Lux-A Collection. 



5 TABLE 4 

Heat shock regulon gene fusions 



Lux clone 


Strain 
name 


insert 
size 


fusion to:. 


Genes in 
cloned 
DNA 


Basal 
RLU* 


3% Ethanol 
Response 
ratio# 


lux-a. pk057.hl 


DPD2241 


581 


grpE 


yjjB'grpE' 


61.1 


5.2 


lux-a. 
pk021.gll 


DPD2242 


2154 


Ion 


'clpP clpX 


6.71 


5.6 


lux-a. pk089.e7 


DPD2246 


1989 


rpoD 


'dnaG 
rpoD* 2 


9.49 


2.4 


lux-a. pk054.c3 


DPD2243 


2319 


clpB 


'sfhByfiH 
clpB '3 


38.5 


4.7 


lux-a. pk040.e4 


DPD2244 


2650 


rfaDFCL 


yibB'rfaD 
rfaF' 


33.1 


1.4 



*From zero time point of control (untreated) actively growing cultures in Vogel-Bonner 
minimal medium that had been obtained during the screen for sulfometuron methyl 
10 responses (Van Dyk et aL, J. Bacteriol. 180:785-792). 

#The ratio of the bioluminescence from the culture treated with 3% ethanol at 40 minutes 
after ethanol addition to LB divided by the bioluminescence of the untreated control in 
LB at the same time point. 
^clpP and clpX are cotranscribed; thus, there is not expected to be a promoter upstream of 
15 clpX. 

2 This fragment should contain only the rpoD specific promoter (Tylor et aL, (1984) Cell. 
38:371-381) 

3 There is no intergenic space between sfhB and yfiH 9 thus there is unlikely to be a promoter 
upstream of yfiH. 

20 The mark ('or') is used to designate that the E. coli gene in the chimeric reporter gene fusion 
is truncated. If the ' is at the start of the gene name (e.g. lacZ), it means the 5' end is 
missing. If the ' is at the end of the gene (e.g. lacZ') it means the 3' end of the gene is not 
present. 

25 The light production from each of these strains in the absence of stress is consistent 

with each of these chromosomal DNA fragments containing an active promoter because 
these values are much greater than that from a culture carrying the parental plasmid 
pDEW201, which had 0.002 RLU under the same conditions (Van Dyk et aL, (1998) 
J, Bacteriol. 1 80:785-792). These strains were tested for their ability to report on the heat 

30 shock stress response by using 5, 4, 3, and 2 (v/v)% ethanol, a known, potent chemical ■ 
inducer. For each of the six strains, at least two concentrations of ethanol resulted in 
increased bioluminescence as compared with the control, untreated culture. Table 4 contains 
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the response ratio for each strain to 3% ethariol at 40 minutes of treatment. These data 
demonstrate that these heat shock regulon gene fusions from the Lux-A Collection report the 
induction of the heat shock response. 
SOS regulon gene fusions . 

In a like fashion as described above, the Lux-A Collection was examined to find 
fusions to genes in the SOS DNA damage response regulon (Kim et ah, (1997) Proc Natl 
AcadSci USA 94(25): 13792-7; Walker, G. C.<1996) In Escherichia coli and Salmonella: 
Cellular and Molecular Biology. ASM Press pp. 1400-1416). Table 5 shows the genes or 
operons for which the randomly generated Lux-A Collection was examined and the number 
of gene fusions present for each gene. 

TABLE 5 



Quantities of Lux-A Collection Gene Fusions to SOS Regulon members 



SOS gene or operon 


Number of Fusions in the Lux-A Collection 


urnuDC 


0 


recA 


2 


uvrA 


1 


uvrB 


0 


sulA (sfiA) 


0 


KvrD 


1 


recN 


0 


polB (din A) 


0 


dinP (dinB) 


1 


dinD 


0 


dinF 


1 


dinG 


1 


dinl 


0 


ruvAB 


1 



Of the fourteen SOS regulon genes or operons examined, seven (50%) were found to 
be represented by useful fusions in the Lux-A Collection. Thus, this collection contains 
members that should report on activation of the SOS stress response. Table 6 summarizes 
the information on the eight strains that were reisolated from wells of the frozen Lux-A 
Collection. 
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TABLE 6 
SOS regulon gene fusions 



Lux clone 


Strain 
name 


insert 
size 


fusion to: 


Genes in Insert 
DNA 


Basal 
RLU* 


Nalidixic 

acid 
response 
ratio# 


lux-a. 
pk085.a5 


DrDZZQ / 




ctinr 


plSt>(-) ugKA ( ~rj 

lexA(+) dinF(+)' 




Z.Z 


lux-a. 
pk0024.f5 


DPD2248 


2022 


dinG 


l rhlE(+) ybiA(~) 

J- S~i /'tit 


32.8 


2.0 


lux-a. 

pKUDD.a3 


DPD2249 


2125 


dinP 


fliiA(-)'mbhA(+) 


4.41 


5.0 


lux-a. 
pk022.d4 


DPD2250 


1789 


recA 


'mtlB(-) ygaD(-) 
recA(-)' 


97.5 


1.8 


lux-a. 
pk085.b5 


DPD2251 


1782 


recA 


■mtlB(-) ygaD(-) 
recA(-)' 


99.5 


2.8 


lux-a. 
pkOLb6 


DPD2253 


2105 


uvrA 


yjcC(+)'yjcB(-) 
ssb(+) nvrA(-)' 


not 
done 


1.4 


lux-a. 
pk052.b6 


DPD2254 


1632 


KvrD 


xerC(+) 
yigB(+) 
uvrD(+)' 


15.7 


1.2 


lux-a. 
pk058.f2 


DPD2293 


1890 


ruvA(-) 


'yebC(-) ruvC(-) 
yebB(+) ruvA(-)' 


14.3 


4.6 



*From zero time point of control (untreated) actively growing cultures in Vogel-Bonner 
minimal medium that had been obtained during the screen for sulfometuron methyl 
responses (Van Dyk et ah, (1 998) J. Bacteriol. 1 80-785-792). 



#The ratio of the bioluminescence from the culture treated with 5 ]-ig/ml of nalidixic acid at 
110 minutes after nalidixic acid addition to LB divided by the bioluminescence of the 
untreated control in LB at the same time point. 

The mark ('or') is used to designate that the E. coli gene in the chimeric reporter gene fusion 
is truncated. If the 1 is at the start of the gene name (e.g. 'lacZ), it means the 5* end is 
missing. If the ' is at the end of the gene (e.g. lacZ 1 ) it means the 3 1 end of the gene is not 
present. 

Each of these strains that were tested for basal bioluminescence in defined medium 
had a greater level of light production than did a strain containing the parental plasmid, 
indicating the presence of a promoter driving expression of the liccCDABE reporter. The 
ability of these jyvomoter-litxCDABE fusions to report activation of the SOS stress response 
was tested using nalidixic acid, a known, potent chemical inducer. The final concentrations 
of nalidixic acid were 1, 5, 25, and 125 p.g/ml for all but strain DPD2293, which was tested 
at 80, 40, 20, 10, 5, 2.5, and 1.25 ^ig/ml. For each of the eight strains, at least three 
concentrations of nalidixic resulted in increased bioluminescence. Table 6 gives the 
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response ratio to 5 H-g/ml of nalidixic acid at 110 minutes after addition. These data 
demonstrate that induction of the SOS response is reported by increased bioluminescence for 
these SOS regulon gene fusion in the Lux-A Collection. 

Specificity of responses was tested by measuring the effect of ethanol on these SOS 
regulon gene fusions. In each case there was little to no increased bioluminescence induced 
by 5, 4, or 3% (v/v) ethanol treatment. For example, the treatment with 3% ethanol at 
40 minutes after addition, a condition which resulted in increased light production from all 
the heat shock regulon gene fusions (Table 4), for strain DPD2249 containing a dinP- 
luxCDABE gene fusion yielded a response ratio was 0.80. Similarly, when strain DPD2243 
containing a heat shock regulon clpB-luxCDABE fusion was tested with nalidixic acid, no 
increase in bioluminescence was observed; at 1 10 minutes after addition of 5 jig/ml nalidixic 
acid, the response ratio was 0.40. Thus, the specificity of the heat shock response to ethanol 
and the SOS response to nalidixic acid was shown. 
SoxS-regulated oxidative damage regulon gene fusions . 

The Lux-A Collection was also examined for the presence of fusions to genes in the 
SoxR and SoxS regulated oxidative stress response regulon (Koh et ah, (1999) Mol. Gen. 
Genet. 261:374-380; Rosner and Storz (1997) Curr. Top. Cell Regul 35:163-177; Van Dyk 
et al. 9 (1998) J. Bacteriol. 180:785-792). Table 7 below shows results. 

TABLE 7 



Quantities of Lux-A Collection Gene Fusions to the SoxR/S regulon members 



SoxR/A regulon gene or operon . 


Number of Fusions in the Lux-A Collection 


sodA 


0 


nfo 


0 


fumC 


0 


achA 


0 


fpr 


0 


zwf 


3 


micF 


0 


acrAB 


0 


inaA 


1 


pqiAB 


0 


ribA 


1 


poxB 


1 



In a similar fashion to the heat shock and SOS regulons, the SoxR/S regulon was not 
fully represented in the Lux-A Collection. Nevertheless, the collection contained gene 
fusions to 33% of the SoxR/S regulon gene or operons that would be expected to report on 
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the activation of this stress response. Table 8 summarizes the information on the six strains 
that were reisolated from wells of the frozen Lux-A Collection. 

TABLE 8 
SoxR7S regulon gene fusions 



Lux clone 


Strain 
name 


insert 
size 


fusion 
to: 


Genes in Insert 
DNA 


Basal 
RLU* 


viologen 
response 
ratio 


lux-a. 
pk053.bl 


DPD2278 


1708 


zwf 


pykA(+) ■ 

yebK(+)zwf(-)' 


5.0 


Not tested 


lux-a. 
pk082.g7 


DPD2272 


1708 


zwf 


pykA(+)' 
vebKMzwf(-)' 


16.9 


3.6 


lux-a. 
pk088.el 


DPD2279 


1709 


zwf 


pykA(+)- 
yebK(+) zwf(-) 


4.2 


Not tested 


lux-a. 
pk078.d2 


DPD2286 


2308 


ribA 


yciM(+)' 
ol02(+) 

PZPBC+) ribA(-)' 


5.8 


4.7 


lux-a. 
pk014.a9 


DPD2087 


1583 


inaA 


•glpQyhaH 
inaA ' 


9.8 


9.0 


lux-a. 
pk071.all 


DPD3509 


1131 


poxB 


•b0872poxB' 


1.5 


15.0 



* From zero time point of control (untreated) actively growing cultures in Vogel-Bonner minimal medium that 
5 had been obtained during the screen for sulfometuron methyl responses (Van Dyk et al., (1998) 

J. BacterioL 180:785-792). 

# The ratio of the bioluminescence from the culture treated with 250ug/ml of methyl viologen at 120 minutes 

after methyl viologen addition to LB divided by the bioluminescence of the untreated control in LB at the 
same time point. 

10 The mark ('or') is used to designate that the E. coli gene in the chimeric reporter gene fusion is truncated. If 
the ' is at the start of the gene name (e.g. 'lacZ), it means the 5' end is missing. If the 1 is at the end of the 
gene (e.g. lacZ 1 ) it means the 3* end of the gene is not present. 

Like the SOS and heat shock regulon fusions, these had basal light production greater 
15 than the promoterless parental plasmid indicating the presence of a promoter. 

Responsiveness of four of these strains to a known inducer of the SoxR/S regulon, methyl 
viologen, was tested at 1000, 500, 250, 125, 62, 31 and 16 ug/ml. Each of these seven 
methyl viologen concentrations induced increased bioluminescence from each of the four 
strains. Table 8 gives the response ratio for 250 jig/ml at 120 minutes of treatment for the 
20 four tested strains. Here again, the biologically appropriate response was observed, 
OxvR-regulated oxidative damage regulon gene fusions . 

Fusions to OxyR regulated genes (Rosner and Storz. (1997) Curr. Top. Cell ReguL 
35:1 63-177) were found in the Lux-A Collection as summarized in Table 9. 
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TABLE 9 

Quantities of Lux-A Collection Gene Fusions to the OxyR regulon members 



OxyR regulon gene or operon 


Number of Fusions in the Lux-A Collection 


katG 


0 


gorA 


0 


dps 


0 


ahpCF 


3 



Of these four genes or operons controlled by OxyR, fusions to one (25%) was 
available in the Lux-A Collection. Table 1 0 summarizes the information on the three strains, 
each containing a fusion of the ahpCF operon regulatory region to the luxCDABE reporter, 
5 that were reisolated from wells of the frozen Lux-A Collection. 

TABLE 10 
OxyR regulon gene fusions 



15 



20 



Lux clone 


Strain name 


insert 
size 


fusion 
to: 


Genes in 
Insert 
DNA 


RLU 
initial 
screen* 


H 2 0 2 
response 
ratio# 


H 2 0 2 
response 
ratio in 
pcnB- 
hostt 


lux-a. 
pk051.d5 


DPD2283 


1184 


ahpC 


dsbG' 
ahpC(+)' 


92.2 


1:3 


16.5 


lux-a. 
pk051.e3 


DPD2284 


1184 


ahpC 


dsbG 1 
ahpC(+y 


68.3 


1.6 


15.3 


lux-a. 
pk03.d6 


DPD2285 


1182 


ahpC 


dsbG' 
ahpC(+)' 


47.2 


1.2 


not done 



*From zero time point of control (untreated) actively growing cultures in Vogel-Bonner 

minimal medium that had been obtained during the screen for sulfometuron methyl responses (Van Dyk 
et ah (1 998> J. BacterioL 1 80: 785-792). 
10 # The ratio of the bioluminescence from the culture treated with 0.002% hydrogen peroxide at 

30 minutes after hydrogen peroxide addition to LB divided by the bioluminescence of the untreated control 
in LB at the same time point. 
{The ratio of the bioluminescence from the culture of E. coli host strain DPD2228 

[F- lac4169 rpsL pcnB80 zad-2084::Tnl 0] containing the plasmid DNA from the original Lux-A 
Collection strain listed treated with 0.002% hydrogen peroxide at 30 minutes in LB after hydrogen peroxide 
addition divided by the bioluminescence of the untreated control in LB at the same time point. 
The mark ( e or s ) is used to designate that the E. coli gene in the chimeric reporter gene fusion 

is truncated. If the 1 is at the start of the gene name (e.g. 'lacZ), it means the 5' end is missing. If the ' is at 
the end of the gene (e.g. lacZ') it means the 3* end of the gene is not present. 

The high level of light production from these three strains containing ahpC- 
luxCDABE gene fusions indicates that they each contained a very active promoter. To test if 
these strains would report on activation of the OxyR-controlled oxidative stress response, 
each was treated with hydrogen peroxide at 0.016, 0.008, 0.004, 0.002, 0.001, 0.0005, 
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0.00025%. At best, the bio luminescence was minimally induced. Table 10 shows the 
response ratio to treatment with 0.002% hydrogen peroxide at 30 minutes after treatment. 

Previously, another luxCDABE fusion in plasmid pDEW201 to a gene in the OxyR 
regulon, katG had been shown to yield larger response ratios to hydrogen peroxide when the 
5 plasmid was moved to an E. coli host strain containing a pcnB mutation (Van Dyk et al., 
(2000) in press. In A. Mulchandani and O. A. Sadik (ed.% Recent Advances in 
Environmental Chemical Sensors and Biosensors. ACS Symposium Series). Mutations in 
this gene result in reduced plasmid copy number for plasmids with origins of replication like 
that in pBR322 (Lopilato et al., (1986) Mol. Gen. Genet. 205:285-290), thus resulting in 

10 reduced basal expression of gene fusions carried on such plasmids. Accordingly, to test if 
reducing the copy number of the ahpC-liaCDABE fusions would also yield more reliable 
detection of the OxyR-mediated stress response, plasmid DNA taken from two of these 
Lux-A Collection strains was moved by transformation into a pcnB' mutant host strain. The 
two resulting strains were tested for responses to hydrogen peroxide at 0.004, 0.002, 0.001, 

15 0.0005, 0.00025, 0.00012, 0.00006%. Each of these seven concentrations dramatically 

induced increased bioluminescence from both ahpCAuxCDABE fusion strains in the pcnB' 
host. The last column of Table 10 gives the response ratio to treatment with 0.002% 
hydrogen peroxide at 30 minutes. Thus, the appropriate biological response from these 
fusions to a highly expressed gene was obtained when the copy number of the gene fusion 

20 was reduced. 

In one instance where the response from the plasmid-borne gene fusion to a highly 
expressed gene was weak, it was demonstrated that reduction of plasmid copy number with a 
pcnB mutation resulted in more potent induction. Other fusions to highly expressed genes 
can also be moved to a host strain with reduced copy number, such a pcnB mutant, or 
25 integrated into the chromosome at a gene dosage of one (Elsemore, D. A. (1998) Methods in 
Molecular Biology: Bioluminescencent Protocols., vol. 102, p:97-104, Humana Press, Inc). 

EXAMPLE 3 

Use of a genome-registered collection of reporter gene fusions to confirm 
or question results from DNA array analysis and to develop high 
30 throughput screens based on gene expression 

DNA microarray experiment with Mitomycin C (MMC) 

The E. coli strain MG1655 (rph-T) was used. Cultures grown in LB at 37 °C 
overnight were diluted 1 to 250 into fresh LB and grown at 37 °C with aeration. Each 
subculture was split into two 1 00 ml cultures when the reading on a Klett-Sammerson 
35 colorimeter with the red filter reads 20 Klett units. MMC (Sigma, dissolved in ddH 2 0) at a 
final concentration of 250 ng/ml, a sub-lethal dose, was added to one of the split cultures. 
The other culture was a no addition control. Incubation at 37 °C continued for another 
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40 minutes, then cells were collected for preparing total RNA. The MMC treated culture 
and its control reached 60 and 55 Klett units, respectively after 40 minute treatment. 

Total RNA purification, first-strand cDNA labeling, preparation of the E. coll whole 
genome high-density microarray chips, hybridization and data analysis were done as 
5 previously described (Wei et al. (2001) J. Bacteriol 1 83: 545-556). Both cy3 and cy5 were 
used in probe labeling, and the hybridization experiments were repeated by swapping the 
fluorescence cy dyes between each pair of MMC treated sample and its blank control. 

Ratios of expression in the mitomycin C treated samples vs. controls were calculated 
for all genes in the DNA array. Ratios greater than or equal to 2 were considered induced 

10 genes, while those with ratios less than 2 fold were considered uninduced. The known SOS 
genes (Kim et al., (1997) Proc Natl Acad Sci USA 1997 Dec 9;94(25): 13792-7; Lomba 
et al, (1997) FEMS Microbiol Lett 156:1 19-122; Walker, G. C. (1996). In Escherichia coli 
and Salmonella: Cellular and Molecular Biology. ASM Press pp. 1400-1416) fell into both 
the induced (Table 1 1) and uninduced classes (Table 12). In addition, 20 genes not 

15 previously known to be induced by MMC were observed to be induced in the array 
experiment (Table 13). 

TABLE 11 



Known SOS genes induced by MMC in array experiment 



Gene 


Fold induction/ 
array experiment 


Available Lux 
Fusion 


Induction of Fusion 


recN 


8.3 


NO 




recA 


3.0 


YES 


YES 


lexA 


3.6 


NO 




dinl 


6.7 


NO 




dinD 


2.2 


NO 




uvrA 


2.3 


YES 


YES 


uvrB 


2.1 


NO 




ruvA 


2.0 


YES 


YES 


sulA 


5.8 


NO 




umuC 


2.1 


NO 




dinB (dinP) 


2.0 


YES 


YES 


bl848 (yebG) 


5.8 


NO 
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TABLE 12 



Known SOS genes NOT induced by MMC in array experiment 



Gene 


Fold induction/ 
array experiment 


Available Lux 
Fusion 


Induction of Fusion 


uvrD 


1.4 


YES 


YES 


polB (dinA) 


1.2 


NO 




dinG 


1.4 


YES 


YES 


dinF 


1.8 


YES 


YES 


himA 


1.3 


NO 




ruvB 


1.5 


NO 




umuD 


1.2 


NO 





TABLE 13 

Genes not previously known to be DNA damage-inducible 



found induced by 


MMC in array experiment 


Gene 


Fold induction/ 
array expt 


Available Lux 
Fusion 


Induction of Fusion 


mioC 


2.1 


NO 




xseA 


2.0 


NO 




insB 2 


2.2 


NO 




insB 1 


2.1 


NO 




insA 4 


2.1 


NO 




secG 


2.2 


NO 




exbD 


2.2 


YES 


NO 


trkH 


2.1 


YES 


NO 


in/A 


2.3 


- NO 


r 


hslS 


6.7 


NO 




hslT 


4.0 


NO 




cspA 


2.9 


NO 




dniR 


2.1 


YES 


NO 


b0531 


3.3 


NO 




bl 847 (yebF) 


3.3 


YES 

(when inverted) 


YES 


M228 


2.5 


NO 




b2940 


2.3 


NO 




b0571 (ylcA) 


2.2 


YES 


NO 


b2559 


2.0 


NO 




b3199 


2.0 


NO 
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Verification of expected responses of MMC induction of known SOS genes 

The Lux-A Collection of reporter gene fusions was examined for presence of fusions 
to the genes in Table 1 1 . Four were found to be present in the collection. Each of these was 
tested for induction by MMC at several doses over a time course of 100 minutes. The 
expected induction response of a lag period with no change in gene expression followed by 
an induction of increased bioluminescence was observed in all four cases at several doses of 
MMC. Thus, this demonstrates that results from DNA array experiments can be verified by 
using corresponding gene fusions. 

Questioning the negative result of non-induction of known SOS genes 

As expected, the expression of the known SOS genes was elevated; however the 
expression of several was elevated less than 2-fold (Table 12) and as such were within a 
large group of 792 genes the expression of which was elevated by 20% or more. Most of 
these are likely due to artifacts in the array data rather than to actual biologically relevant 
responses. To test if strains carrying IuxCDABE gene fusions would yield the expected 
positive result, the three gene fusions that were available in the Lux-A Collection of reporter 
gene fusions were tested for mitomycin C responses. In all three cases, mitomycin C 
induced increased bioluminescence. Thus, this demonstrates that negative results from DNA 
arrays can be questioned by contradictory positive results with corresponding gene fusions. 
Questioning or confirmation of induction of previously unknown MMC inducible genes 

The genes that were not previously known to be induced with mitomycin C were . 
further examined for correlation of DNA array and gene fusion experimental data. For this 
class of genes, four fusions were available in the Lux-A Collection of reporter gene fusions. 

The corresponding IuxCDABE fusions to these four genes provided no evidence of 
increased gene expression induced by MMC. 

An additional gene fusion in the Lux-A Collection of gene fusions was found to have 
a genomic fragment that when inverted would result in a fusion to yebF. In this case, a 
divergent promoter to the purT gene was present in the chromosomal fragment and strains 
containing the backward yebF fusion produced light. Following isolation of plasmid DNA, 
Xmal digestion that releases the insert DNA from the vector, religation and transformation, 
10% of the transformants produced light. Of these, 20% (or 2% of the initial transformants) 
were found to be highly induced by nalidixic acid, another DNA damaging agent. This 
result strongly suggested that inversion of the insert DNA had occurred in these nalidixic- 
acid inducible gene fusions. The orientation of the inserted segment to yield a fusion of 
yebF to the IuxCDABE operon was confirmed by DNA sequence analysis. Furthermore, 
induction by mitomycin C was demonstrated, as shown in Figure 2. 
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EXAMPLE 4 

Functional Definition of Postulated Promoter Regions, 
The genes encoding production of type I extracellular polysaccharide in E. coli are 
located in a cluster of 20 genes (Stevenson et al. (1996) 1 Bacterid 178: 4885-4893). An 
5 upstream promoter for these genes has been identified (Stout, V. (1996) J. BacterioL 178: 
4273-4280) and its regulation has been characterized (Gottesman, S. (1995) Two-component 
Signal Transduction. American Society of Microbiology pp253-262; Wehland and Bernhard 
(2000) J. Biol Chem. 275:7013-7020). Nonetheless, the transcriptional organization of this 
region has not been completely defined. The annotated sequence for these genes (Blattner et al. 

10 (1997) Science 277:1453-1462), which are transcribed from one strand of the genome, suggests 
the existence of several putative promoters and activator binding sites. Furthermore, a 
prediction of operon structure in this region suggests that these genes may be organized into 
several transcriptional units (Thieffry et al. (1998) Bioinformatics 14:391-400). Figure 8 
summarizes the predicted promoters in this region. Unexpectedly, an E. coli DNA microarray- 

15 based experiment with strains 397C, containing a truncated ' subunit of RNA polymerase, and 
P90, an isogenic rpoC + control, suggested that RNA transcripts in this region were affected by 
the rpoC mutation. Expression of genes b2043 through b2062 was coordinately upregulated in 
the rpoC mutant (Figure 8). The elevated expression is most readily explained if a single 
transcript starts before b2062 and ends between b2043 and b2042 (galF). If this is true, the 

20 region upstream of b2062 should contain a promoter. That region was fused to the luxCDABE 
operon in Iux-a.pk033.g2. Transformants of P90 and 397C were grown at 30°C in LB medium 
containing 1 00 \ig/m\ ampicillin. Bioluminescence and turbidity, recorded with a Klett- 
Summerson colorimeter (Van Dyk et al. (1995) J. BacterioL 177:6001-6004), of actively 
growing cultures were used for calculation of light production per 1 0 9 cells. The unpaired t-test 

25 was used to compare quadruplicate bioluminescence measurements of strains carrying gene 

fusions to the control strain carrying the parental plasmid. The bioluminescence produced when 
the Iux-a.pk033.g2 gene fusion was placed into strain P90 was weak (0.56 +/- 0.06 RLU/10 9 
CFU), but yet was significantly greater (P<0.0001) than the bioluminescence produced by the 
parental plasmid in the same host strain (0.028 +/- 0.010 RLU/1 0 9 CFU). These data are 

30 consistent with a promoter in the region upstream of b2062 that is not very active in strain P90 
growing in LB medium. The bioluminescence of this gene fusion was elevated 1500-fold when 
placed in strain 397C (Figure 8). This strong promoter activity driving luxCDABE gene 
expression is, therefore, dependent upon the rpoC" mutation. In contrast, several other gene 
fusions to chromosomal DNA segments in this region, whether they contained predicted 

35 promoter regions or not, had very low levels of activity in both the wild type and rpoC~ mutant 
(Figure 8). Thus, the data from both the DNA microarray experiments and from gene fusions 
are consistent with cotranscription of the twenty genes in this region. The end of the operon was 
defined by a gene fusion to the galF upstream region (in Iiix-a.pk07.dl2) that strongly drove 
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luxCDABE transcription. The activity of this promoter region was not ^dramatically effected by 
the rpoC mutation (Figure 8), in agreement with the DNA array data. 

EXAMPLE 5 
Highly Parallel Transcription Analysis Using 
5 A High Density Array of Cellular Reporters 

Proof of principle with Luxarray 0.5 

A solid phase assay system consisting of reporter clones growing in the presence or 
absence of a perturbation, biological, environmental or chemical was developed. This was 
accomplished by growing the reporters on a porous membrane (Biodyne B, Nunc) seated on 
• 10 top of solid growth media in a culture dish (OmniTray, Nunc). Luminescence was measured 
either by a lumihometer or by creating an image of the entire culture and quantitating the 
pixel density. Effects of perturbants can be determined by comparison of luminescence 
generated by treated and control cultures. Growth on the surface of the membrane allows the 
reporter assay to be moved between conditions as required. Experimental protocols often 

15 involve perturbations that prohibit long term exposure due to cell death or other irreversible 
effects. The ability to move the entire array to new growth conditions allows one a great 
variety of experimental schemes such as pulsed or pulse/chase exposures, reversibility, and 
short term kinetic studies. 

Several important characteristics of the assay system needed to be evaluated. In 

20 particular, growth density and conditions, sensitivity, reproducibility, and the ability to 

perturb the reporters and detect changes were major focus points. Development of this assay 
system was initiated using a collection of 10 well characterized clones, the parental plasmid 
clone, and media alone (Table 14). The set of 10 clones represented a distribution of signal 
strength as well as response to perturbation with DNA damaging agents, in particular 

25 nalidixic acid. Initial experiments were designed to benefit from the sensitivity and 
simplicity of measuring luminescence with a 96-well luminometer (Dynex ML3000). 
Printing was accomplished using a BioMek 2000 (Beckman Coulter) equipped with a High 
Density Replication Tool (HDRT). Sterilization in between transfers was accomplished by 
soaking the pins successively in 0.2% SDS in water, sterile water, and 70% ethanol. After 

30 sterilization, the pins are air-dried prior to the next transfer. 
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TABLE 14 
Luxarray Q.5 Clone Description 



Strain 


Clone 
No. 


Fusion to: 


Bioluminescence* 


Comment 


DFD2083 


• TvT A 

N.A. 




none 


Parental plasrmd without insert 


DPD2282 


65.64 


lacZ 


moderate 


"Constitutive" expression in LB/low 
expression in LB+glucose 


DPD2247 


85. a5 


dinF 


low 


SOS regulon/Nahdixic acid and 
Mitomycin C inducible 


DPD2248 


24.f5 


dinG 


moderate 


SOS regulon/Nalidixic acid and 
Mitomycin C inducible 


DPD2249 


55-.a3 


dinP 


moderate 


SOS regulon/Nalidixic acid and 
Mitomycin C inducible 


DPD2250 


22.d4 


recA 


high 


SOS regulon/Nalidixic acid and 
Mitomycin C inducible 


DPD2253 


01. b6 


uvrA 


moderate 


SOS regulon/Nalidixic acid and 
Mitomycin C inducible 


DPD2242 


21.gll 


Ion 


moderate 


Heat shock regulon 


DPD2243 


54x3 


clpB 


moderate 


Heat shock regulon 


DPD2245 


42.c8 


rpoD 


low 


Heat shock regulon 


DPD2084 


06.b4 


yciG 


low 


SigmaS-dependent stress responsive 


DPD2090 


23. c7 


osmY 


low 


SigmaS-dependent stress responsive 



*Bioluminescence is measured in the liquid culture by lurninometer. 



Strains were grown overnight at 37 °C in 40 jllI LB in 96-well dishes. These cultures 
were designated "printing plates" and were used as the cell source to manufacture the arrays. 

Membranes were sterilized with UV illumination for 10 min then placed in contact 
with prewarmed media in a culture dish. The clones and controls were printed onto an 
approximately 8x12 cm membrane mimicking the pattern of a 96-well plate (Figure 3 A). 
Strains were printed to generate 8 repeating sections of the 10 strains and two controls and 
placed in a Dynex ML3000 luminometer prewarmed to 37 °C. Luminescence data was 
collected for each spot for 40 cycles of 20 min duration (overnight). Figure 3B shows the 
signal collected for each clone as a function of time. Figure 3C shows signal collected from 
the replicate spots of the same clone (recA-luxCDABE). The data clearly indicated that the 
strains were well behaved in the new solid phase system. Without exception, the signal 
strength measured corroborated the data obtained in liquid culture. Additionally, replicates 
varied minimally, again similar to liquid culture measurements. 

Next the system was evaluated for its ability to determine responses to perturbation, 
in this case, DNA damage caused by nalidixic acid. Parental clone and two nonresponding 
reporters, osmY, and lacZYA, and three DNA damage responsive reporters, uvrA, recA, dinG, 
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were chosen. Strains were printed onto duplicate membranes over LB agar from fresh 
overnight cultures as described and incubated for 6hr at 37°C to allow the cells to enter an 
exponential growth phase. The membranes were moved to new, pre-warmed plates 
containing varying amounts of nalidixic acid and light readings collected with an ML3000 
5 luminometer. Luxarray 0.5 clones (Figure 4A and B) were printed onto membranes and 
grown initially on LB then moved to plates containing 0 jug/ml (diamonds), 1 fig/ml 
(squares), or 5 jj.g/ml nalidixic acid (triangles). Luminescence was measured with a Dynex 
ML3000 luminometer every 30 min for 2 hrs. As shown in Figure 4B, the expected nalidixic 
acid mediated upregulation of genes in the SOS regulon was detected as increased 

10 bioluminescence. In contrast, strains containing fusions to non-DNA damage responsive 
genes, osmYand lacZYA, were unaffected by nalidixic acid treatment (Figure 4A). The 
strain containing the parental plasmid, pDEW201, without a promoter driving luxCDABE 
expression produces very low levels of bioluminescence that are very close to the 
background measured on a ML3000 luminometer for strains grown in 96-well microplates. 

15 The apparent basal bioluminescence and upregulation of this strain (Figure 4A) is likely due 
to cross-talk from an adjacent strain containing a DNA damage responsive gene fusion. 
High Density Luxarray 0.5 

The clone set of Luxarray 0.5 was used to develop a highly parallel solid phase assay 
by printing the clones at high density on the approximately 8x12 cm membrane. These 

20 arrays were used to further develop the system. 
Printing Density 

In an effort to more closely approximate the final assay system, the clones of 
Luxarray 0.5 were used to determine the maximum density one could print the cells at and 
carry out useful analyses. Arrays were printed as above at increasing density and grown for 

25 8 In- at 37 °C. Luminescence was imaged using an EagleEye II (Stratagene, La Jolla, CA). 
From visual inspection individual clonal areas of growth could be resolved at densities up to 
6144 individual spots per membrane. From a comparison of the size of the growth areas 
from different densities, it appeared that in the 8 hr growth period, there was a nutrient-limit 
effect that resulted in an inversely proportional amount of growth as density increases. At 

30 this density, there is over two-fold greater then the estimated number of transcription units in 
the entire E. coli genome. Thus a single 8x12 cm array could represent the entire E. coli 
genome in duplicate with capacity left for controls. Alternatively full genome coverage of 
two different strains or different species could be printed on the same array. 
High Density Array Image Collection 

35 The essence of this assay is to collect an image of the signal generated from reporter 

constructs such that the signal intensity can be subsequently quantified. This requires not 
only that the collection parameters (focal plane, magnification, integration time, and 
algorithm) are constant but also that the downstream image analysis software has the ability 
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to process the images generated. The most common application of this assay, chemical 
perturbation, requires physically relocating a membrane from one culture plate to another. 
This results in images with minimal X-Y positional registration. Several commercially 
available products can efficiently process these images. Array Vision™ (Imaging Research, 
5 Toronto, Canada) and ImageQuant (Molecular Dynamics, Sunnyvale, CA) are two examples 
of appropriate software packages. 

High Density Luxarray 0.5 Nalidixic Acid Perturbation 

The clones of Luxarray 0.5 were used to print arrays as described above at a 4x4 
density. That is to say that each single spot as in Figure 3 was replicated 1 6X in a 4x4 

10 subarray* This resulted in each clone being printed 128 times on different areas of the array. 
Arrays were printed in triplicate from fresh overnight cultures onto membranes on plates 
containing LB media. After 6 hr of growth at 37°C membranes were moved to prewarmed 
plates containing either LB media or LB media supplemented with 5 ^g/ml nalidixic acid 
(NA) previously demonstrated to cause detectable induction of responsive promoters for a 

15 wide range of promoter strength (Figure 4). Cultures were replaced at 37 °C to continue 

growing. Images were collected for each array every two hours from 0-8 hr after relocation 
using a cooled CCD camera (FluorChem 8000: f0.85 lens, 2min exposure. Alphalnnotech). 
Spot intensity was determined using ArrayVision™ (Imaging Research, Toronto, Canada). 
Figure 5 shows the results for selected strains containing reporter gene fusions. 

20 As found with the low density experiments quantified with the luminometer, the 

expected responses for each clone were well demonstrated. The five documented DNA 
damage-responsive reporter constructs clearly show an upregulation of expression. In 
contrast, light production from the strain carrying the lac promoter fusion as well as several 
strains carrying other promoter fusions was decreased. This decreased bioluminescence 

25 likely reflected the decreased growth and metabolism of the nalidixic acid treated strains and 
demonstrates the specificity of the upregulation of the SOS-regulon gene fusions. 
Furthermore, the signal from the strain containing the parental plasmid is of dramatically 
lower magnitude than that of strains with promoter-Zax fusions, thus, demonstrating the 
advantage of the cooled CCD camera for data capture. 

30 Lux Array 1.0, a highly parallel promoter activity assay. 

Selection of a maximal non-redundant set of lux gene fusions. 
The genome-registered Lux-A luxCDABE gene fusion collection (described in 
Example 1) provided a list of 4988 plasmid-borne gene fusions, each with boundary information 
relative to the E. coli genome and the orientation relative to the lux operon. An operably linked 

35 or a functional construct was defined as one consisting of a genomic fragment encompassing a 
promoter adjacent to the promoterless lux operon in an orientation that causes transcription 
initiated at the promoter to proceed into and through the lux operon. Therefore, to identify the 
functional subset of the collection, criteria were computationally applied to filter the list of 
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clones first for functionality and second for redundancy. A list of definitions of documented and 
predicted operons (Thieffry et al. (1998) Bioinformatics 14:391-400) was used to define 
genomic coordinates of the operons as the translational start codon position of the first open 
reading frame (ORF) in the operon and the translational stop codon position of the last ORF in 
5 the operon. Additional information included the strand on which the operon is coded (direction 
of transcription), and gene names (common or "b" number). The lux gene fusions were filtered 
computationally using the following assumptions; (i) a functional transcriptional fusion would 
result from any genomic fragment starting greater than 50 base pairs upstream of the start codon 
of the first ORF and ending anywhere between the start codon of the first ORF in the operon and 

10 the stop codon of the last ORF in the operon, thereby eliminating the occurrence of a 

transcriptional stop signal in the construct between the promoter and the lux operon; and (ii) the 
promoter contained in the genomic fragment must face in the correct orientation relative to the 
lux operon (pictorially represented in Figure 6). Finally, in cases when more then one clone fit 
the criteria for a single operon, only the one construct containing the genomic fragment 

15 representing the greatest amount of upstream sequence was retained thereby eliminating 

redundancy. The PERL code used and the resulting list of 689 selected gene fusions are found 
in Scheme 1 and Table 1 8 (following Example 7). These fusions represent 27 % of the 2584 
known and predicted transcriptional units in the E. coli genome. Individual cultures of strains 
containing the identified gene fusions were rearrayed from the ninety original culture plates to 

20 create a set of sixteen 96-well microplates containing all the identified fusions, duplicated with 
side by side symmetry, including appropriately placed controls. These 16 plates represent were 
used to generate the cellular arrays for use in subsequent analyses. 
Preparation of reporter array. 

The E. coli strains were grown overnight at 37°C in 40 ul LB medium supplemented 
25 with 100 u,g/ml ampicillin in a set of sixteen 96-well dishes. These cultures were designated 

"printing plates" and were used as the cell source to manufacture the arrays. Porous membranes 
(Biodyne B, Nunc) were sterilized with UV illumination for lOmin then placed in contact with 
pre-warmed of solid LB growth media in a culture dish (OmniTray, Nunc). Printing of 4 X 4 
subarrays was accomplished using a BioMek 2000 (Beckman Coulter) equipped with a High 
30 Density Replication Tool (HDRT). Sterilization in between transfers was accomplished by 
soaking the pins successively in 0.2% SDS in water, sterile water, and 70% ethanol. After 
sterilization, the pins are air-dried prior to the next transfer. The E. coli strains in the LuxArray 
were printed onto an approximately 8x12 cm membrane in two sets of triplicates. 
Growth in the array. 

35 The growth rate of individual colonies was evaluated because initial experiments clearly 

demonstrated a large distribution of growth rates for the reporter strains in the system. In order 
to differentiate between clone- dependent or system-dependent sources of this variability, 
bioluminescent cellular arrays were generated in triplicate on different days using LB agar media 

51 




WO 01/79419 PCT/US01/1 1855 

containing 10 |j.g/ml tetrazolium blue. The product generated when live cells reduce tetrazolium 
blue is an insoluble blue precipitate. This greatly increased contrast between the cells and the 
media simplifying direct imaging of the cells by normal light. For each of the triplicate 
experiments, each clone was visually scored to determine the size of the growth generated during 
5 8 hrs of incubation at 37 C (data not shown). Variability was clearly clone-specific and very 

consistent from day to day. As the majority of proposed analyses are relative measurements, and 
inter-clone comparisons are unlikely, this type of growth variability does not effect the 
applicability or robustness of the overall assay system. No further attempts were made to 
determine the source of the variability, however it can be assumed that it is a result of the 
10 plasmid constructs carried by the clones. 

Bioluminescence ofLuxArrav 1.0. 

Images of the bioluminescence were using a cooled CCD camera (FluorChem 8000: 
fO.85 lens, 2 min exposure, Alphalrmotech) without additional light source. The 
bioluminescence from one such array grown for 16 hours on rich media is shown in Figure 7. 

15 The array in Figure 7 shows side by side replicate subarrays that include control strains, a strain 
with lacZYA promoter fusion and a strain containing the parental plasmid (pDEW201). 
Identification of Nalidixic Acid Responsive Gene Fusions using LuxArrav 1.0. 

The antibiotic nalidixic acid, an inhibitor of DNA gyrase known to be a effective inducer 
of the SOS DNA damage stress response was used to demonstrate the utility of this array. 

20 Arrays were printed from fresh overnight cultures onto membranes on plates containing LB 

media. After 6 hr of growth at 37°C membranes were moved to pre- warmed plates containing 
either LB media or LB media supplemented with 5 ug/ml nalidixic acid then replaced at 37°C to 
continue growing. Images were collected for each array every two hours from 0-8 hr after 
relocation using a cooled CCD camera (FluorChem 8000: fO.85 lens, 2 min exposure, 

25 Alphalrmotech). 

Spot intensity of each image was determined using Array Vision™ (Imaging Research, 
Toronto, Canada) and the resultant pixel density measurements imported into a template with 
identifiers for each spot. The average signal for each of the triplicate spots was calculated. The 
background signal, which results from cross illumination of neighboring spots, was calculated 

30 by finding the median of the 24 spots containing a strain with the parental plasmid on each of the 
triplicate arrays and the calculating the average of the three medians. This background signal 
was calculated at each of the time points and subtracted from each measurement at the 
corresponding time point. All negative numbers were converted to zero. 

Data normalization to account for inhibition of growth by nalidixic acid was 

35 accomplished by finding the sum of the averaged signals of each spot in the array for each 
treatment at each time point. A normalization factor (NF) was calculated as follows: 

NF^Total array signal (time zero, LB control)/Total array signal (time x, condition y) 
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Each measurement was multiplied by NF to yield a normalized signal. Ratios the nalidixic acid 
treated spot to the corresponding control spot were calculated using the normalized data. 

The ratios of the normalized data were compared for each of the duplicate spots resulting 
from independent cultures in the array. Putative nalidixic acid upregulated gene fusions were 
5 selected as those for which the ratio in both duplicate spots was at least 2 at both the 2 hour and 
4 hour time points. Twelve gene fusions were selected by these criteria (Table 15). 

These twelve gene fusions include three well-characterized members of the SOS regulon 
as well as several fusions to promoters not previously known to be upregulated by nalidixic acid 
treatment. The DNA sequence of plasmid DNA isolated from each of these twelve cultures 
10 reconfirmed the identity of each inserted DNA. The LuxArray contains a total of six gene 

fusions to SOS regulon operons, all of which would be expected to be upregulated by nalidixic 
acid. Three were, thus, scored falsely as negatives. An examination of the data showed that two 
of these false negatives were reproducibly upregulated but not at level of the selection criteria; a 
fusion to uvrD was upregulated by 1 .6 and 1 .9 fold at 4 hours of nalidixic acid treatment, while 
15 one to ruvA was upregulated 1 .7 and 2.1 fold at that time point. The third did not have 

consistent responses; the ratio of nalidixic acid treated to untreated for the dinFAvx fusion was 
found to be 4.2 in one of duplicate spots and 0.7 in the other. 

Validation of nalidixic acid upregulated gene fusions by retesting in liquid medium. 
Each of the newly identified putative nalidixic acid upregulated gene fusions and two of 
20 the known SOS gene fusions were retested using exponentially growing cultures in liquid 
medium with seven concentrations of nalidixic acid (80 fig/ml in two fold dilutions to 
1 .2 jag/ml). Several concentrations of nalidixic acid were used because the differences in 
responses between liquid and solid growth were not known. Light production of 100 \x\ 
duplicate cultures at 37°C was quantitated using a 96 well plate luminometer (Luminoskan 
25 Ascent, Labsystems). Table 15 shows the results expressed as ratios of the signal from the 

nalidixic acid treated cultures to the untreated control at 2 hours at the concentration that yielded 
the maximal response. Also shown is the number of concentrations of nalidixic acid tested that 
resulted in response ratios of 1 .5 or greater. It should be noted that the liquid medium tests were 
not corrected for growth inhibition by nalidixic acid. Using a standard of a maximal response 
30 ratio of at least 1.8 and responses ratios that were >1.5 fold at 3 or more concentrations, 7 of the 
9 putative novel nalidixic acid upregulated gene fusions were shown to be reproduced in liquid 
medium. 

Mitomycin C responses and effect of lexAind mutation . 

Mitomycin C, a DNA damaging compound with a different mechanism of action from 
35 nalidixic acid, was used to determine if these newly discovered nalidixic acid upregulated gene 
fusions were generally responsive to DNA damage. In addition, the effect of a lexAind mutation 
was tested to determine if any of these were part of the SOS regulon. The expectation is that 
SOS regulon member will be induced by both nalidixic acid and mitomycin C in a manner that 
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10 



is dependent on lexA function (Walker, G. C, (1996) Escherichia coli and Salmonella-. Cellular 
and Molecular Biology ASM Press ). As shown in Figure 9, a fusion of the promoter region of 
bl728 to luxCDABE was clearly induced by both nalidixic acid and mitomycin C in the lexA + 
host, but was not induced by these chemicals in the lexAind host strain. Thus, these results 
demonstrate that upregulation by nalidixic acid as well as mitomycin C is controlled by LexA. 
This is consistent with the observation of upregulation of the M728 mRNA transcript upon 
mitomycin C treatment in a LexA dependent fashion (Fernandez de Henestrosa et al. (2000). 
Mol Microbiol. 35:1560-1572). - Likewise, two gene fusions, those to oraA and >7gN were 
identified as new members of the SOS regulon (Table 16). Interestingly, four of the nalidixic 
acid responsive gene fusions were not upregulated by mitomycin C in the lexA + host suggesting 
that they are not generally DNA damage responsive, but rather are more specifically responsive 
to nalidixic acid. Negative, non-DNA damage responsive gene fusions were also included 



Thus, the LuxArray assay was useful to identify novel nalidixic acid upregulated genes in 
E. coli. Likewise, this robust assay can be used generally in a fashion parallel to hybridization 
assays to monitor transcriptional changes. The multiple whole genome scale capacity and 
relative simplicity of manufacture allow for significant throughput. This assay is an important 
addition to efforts making functional assignment of promoter activity. 



(Table 16). 
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Table 16 

Mitomycin C and Nalidixic acid Responses in IexA+ and lexAind hosts 



Fusion to gene or 
operon: 



Lux ID 



Host NA 
Strain Ratio, 
2hr 
lOug/ml 



MitC Ratio, 2hr 
250ng/ml 



Known SOS regulon member: 



dinG ybiB 


lux- lexA 4 " 


4.03 


5.01 




a.pk0024.f5 








IexA ind 


0.97 


LIS 


Nalidixic acid upregulated on both the solid LuxArray and in 




liquid culture 








New SOS regulon members 






bl728 


Iux-a.pk033.c5 lexA + 


5.17 


8.97 




Iux-a.pk033.c5 lexA ind 


0.40 


0.90 


oraA 


Iux-a.pk058.f5 lexA + 


11.06 






Iux-a.pk058.f5 lexA ind 


1.06 


1.21 


yigN 


lux- lexA + 


5.74 


9.84 




a.pk046.ni 








lux- lexA ind 


0.59 


0.93 




a.pk046.fll 






Nalidixic acid upregulated, not generally DNA damage 




inducible 








bll69 


lux- lexA + 


3.22 


1.12 




a.pk0015.d6 








lux- lexA ind 


2.29 


1.73 




a.pk0015.d6 






bl936 


lux- lexA + 


3.25 


1.22 




a.pk0019.gl 








lux- lexA ind 


1.55 


2.46 




a.pk0019.gl 






IpxA lpxB rnhB 


lux-a.pk06 1 .c3 lexA + 


2.87 


1.06 


dnaE 










Iux-a.pk061.c3 lexA ind 


2.39 


1.29 


yaaF 


Iux-a.pk031.e7 lexA + 


1.53 


1.02 




Iux-a.pk031.e7 lexA^ 


1.19 


0.99 
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Negative controls 



yciG 


Iux-a.pk006.b4 lexA + 


0.32 


0.56 




lexA ind 


0.67 


0.61 


lacZ 


65.d4 lexA + 


0.81 


0.80 




lexA ind 


1.05 


1.24 



EXAMPLE 6 

Additions to collections and alternative methods of generating 
collections of bacterial genomic fragments fused to a reporter 
5 There are several methods for making fusions of bacterial promoter regions to 

reporter genes to add to a genome-registered collection of randomly generated gene fusions. 
Some of these methods are also alternative methods for building a large collection of gene 
fusions. 

Firstly, DNA sequence data from more random gene fusions will result in 
10 identification of additional useful members to a collection that is not completely saturated. 
These additional sequences can be generated from members of the same originally 
sequenced library of genetic fusions, LuxA for instance, or an independently generated 
library. The steps outlined in previous examples allow the genome registration of the newly 
sequenced fusions. 

15 DNA sequencing of randomly generated fusions will lead to identification of fusions 

that would be useful except that the orientation of the chromosomal DNA is inverted, such 
that the promoter regions of interest are not operably linked to the reporter genes. Selecting 
such fusions and" inverting the orientation of the insert DNA can significantly enhance the 
utility of a sequenced collection by adding many more operable linked fusion to the 

20 collections. A simple way to do this is to digest the plasmid DNA with a restriction enzyme 
that cuts just outside the cloned region and religate the pieces. Although a mixture of 
plasmids results from this procedure, in many cases the correctly oriented plasmid can be 
found because cells containing it, but not other possible products, will produce light . This 
method has the advantage of avoiding use of the polymerase chain reaction and thus 

25 avoiding possible changes in the DNA sequence from amplification. 

This approach was demonstrated by identifying a non-operably linked clpB fusion in 
the Lux-A Collection, inverting the chromosomal DNA segment relative to the vector, and 
comparing an E. coli strain carrying the resultant plasmid to one in the collection with a 
proper orientation of the clpB promoter region. One isolate from the Lux-A Collection, 

30 Iux-a.pk043.d3 contains the region of the E. coli chromosome with the clpB promoter, but is 
oriented in the opposite direction than required to operably link the promoter and reporter 
genes. Very low levels of light production of this strain were measured. In contrast, the 
strain, Iux-a,pk054.c3, that carries the plasmid with the clpB promoter region in the proper 
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orientation to operably link it to the luxCDABE reporters gene had a high level of light 
production. Plasmid DNA was isolated from Iux-a.pk043.d3 y digested with restriction 
enzyme Xmal, and ligated with T4 DNA ligase. The ligation reaction was used to transform 
E. colt strain DPD1675 by electroporation. Transformants were selected by ampicillin 
5 resistance. Three percent of the resultant transformed colonies produced bioluminescence as 
detected by exposure of X-ray film. The response to ethanol treatment of two of these light 
producing transformants that putatively contain inverted chromosomal inserts, of 
Iux~a.pk054.c3 (DPD2243) that contains a plasmid with the clpB promoter operably linked to 
the luxCDABE reporter and of the original Iux-a.pk043.d3 were compared. Actively growing 
10 cultures at 37°C in LB medium were treated with 4% ethanol and light production was 

quantitated at 37°C using a microplate luminometer (Dynex ML3000). Table 1 7 summarizes 
the average bioluminescence of duplicate samples at 38 min after treatment. 

TABLE 17 



Light production by strains containing a clpB-luxCDABE gene fusion resulting 
from inversion of the chromosomal insert, and controls 



Strain 


Culture 
turbidity' 


Light production 
in LB 2 


Light production in 
LB + 4% ethanol 2 


Response 
Ratio 


Ivoc-a.pk043.d3 


11 


0.0015 


0.0025 


N/A 


Inverted 
Itcx-a.pk043.d3 


14 


1.25 


8.17 


6.5 


Inverted 
Ivcx.-a.pk043.d3 


18 


5.34 


29.63 


5.5 


Iicx-a.pk054.c3 
(DPD2243) 


12 


1.37 


13.00 


9.4 



1 : At time zero; Klett units 
15 2 : RLU 

Thus, the presence of a promoter that drives transcription of the luxCDABE operon in 
the inverted isolates was shown by the light production in LB medium. Furthermore, the 
expected biological response of induction of up-regulation by ethanol treatment for this 
20 member of the heat shock regulon was also demonstrated. 

This inverting method was implemented in a parallel manner to add gene fusions to 
the Lux-A Collection. First, a list of about 400 gene fusions from the Lux-A Collection that 
contained the correct genomic fragment but in the wrong orientation relative to the lux 
operon, and were not present in the current collection was derived. Each of the strains 
25 containing these fusions was picked out of the original collection of 8000 fusions and 

regrown. The bioluminescence of each was quantitated because, in some cases, the" presence 
of a divergent promoter in the cloned DNA fragment resulted in light production even 
though the DNA was inverted with respect to the promoter of interest. About half of the 
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selected strains produced > 0.1 RLU of light following 3 hours of incubations at 37 °C after 
inoculation of 100 ulLB medium with 10 ul of culture from a working plate. This 
suggested that the set of strains with light production of < 0.1 RLU would be easier to pursue 
initially. Thus, following isolation of plasmid DNA in a 96 well format, Xmal digestion, and 
religation, the ligation reaction mixes originating from strains with light production of less 
than 0.1 RLU were sorted into separate plates from those that originated from strains with 
light production greater than or equal to 0.1 RLU. E. coli DPD1675 was made competent by 
calcium chloride treatment and transformed with the religated plasmid DNA selecting for 
ampicillin resistance. Placing the petri plate with the transformant colonies on X-ray film 
and developing the exposed film at various time points of contact identified light producing 
transformants. The light producing colonies were purified by isolating single colonies in 
solidified LB medium containing ampicillin and light production was verified by . 
luminometry. This process was completed for 36 plasmids isolated from strains with light 
production of less than 0.1 RLU. Plasmid DNA from each of the strains containing putative 
inverted chromosomal segments was isolated and DNA sequence was determined to assess if 
the inverted product had been obtained. Thirteen of the 36 plasmids yielded DNA sequence 
information that had a high scoring hit when compared with the E. coli genomic DNA 
sequence using BLASTN (default setting). Of these, five had DNA in the inverted 
orientation from the original clone. 

There are several potential reasons for this relatively low success rate. For example, 
some promoters that are annotated may not be functional in the conditions used or may not 
be biologically relevant promoters. Thus, selection by light production of the inverted clone 
would not be successful. In contrast, divergent promoters that were not active in the liquid 
culture test for bioluminescence used for sorting may have been active oh the solidified 
medium used to screen for light producing clones. This may have lead to selection of a 
colony containing the initial plasmid. Furthermore, there may have been poor plasmid DNA 
yield, poor cutting, and/or poor religations when done in parallel in microplates. This may 
have lead to DNA concentrations in the ligations such that intermolecular ligations were 
minimized or such that a low number of transformant colonies resulted and rare 
transformants with the inverted insert DNA were not represented. Thus, this approach of 
restriction digestion, religation, transformation and selection of light producing 
transformants has been demonstrated to work. However, further optimization for efficient 
parallel implementation is required. 

To test inverting of a chromosomal segment in a plasmid containing divergent 
promoters, the backward fusion to yebF was chosen. Obtaining an operably linked reporter 
gene fusion to yebF was of particular interest because XheyebF gene is an otherwise 
unknown open reading frame that was found in a DNA microarray experiment to be highly 
induced by mitomycin C, a DNA damaging agent. In this case, a divergent promoter to the 
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purT gene was present in the chromosomal fragment and strains containing the backward 
yebF fusion produced light. Following Xmal digestion, religation and transformation, 10% 
of the transformants produced light. Of these, 20% (or 2% of the initial transformants) were 
found to be highly induced by nalidixic acid, another DNA damaging agent. This result 
5 strongly suggesting that the desired inversion of the chromosomal segment had occurred. 
This was verified by DNA sequence analysis. Thus, inverting chromosomal segments 
containing divergent promoters is possible and is facilitated if the two promoters can be 
distinguished by their biological activity. 

An alternative inverting procedure uses plasmid DNA from clones requiring 

10 reorientation as templates for the polymerase chain reaction (PGR). Universal PCR primers 
were designed Which hybridize specifically with the pDEW201 plasmid flanking the 
genomic fragment cloning site (Bam HI). Use of the first primer, pDEWE2S (SEQ ID NO:3, 
S'-GGAATTGGGGATCGGAGCTCCCGGG-S'), an EcoRl site (GAATTC) is converted to a 
SacI site (GAGCTC) via an internal AT to GC mismatch. Use of the second primer, 

15 pDEWS2E (SEQ ID NO:4, 5'-GAATGGCGCGAATTCGGTACCCGGG-3'), results in the 
conversion of the SacI site to an EcoRl site via an internal GC to AT conversion. Thus the 
resultant PCR products from any pDEW201 clone can be digested with EcoRl and SacI and 
Iigated into EcoRl- and Saci-digested pDEW20L The resultant plasmid contains the original 
chromosomal segment in the opposite orientation of the original clone, relative to the lux 

20 operon. As the primers are specific to the vector they can be used for all clones and are 
amenable to high throughput (96-well plate) approaches. 

Construction, sequencing and registering an additional random library of E. coli genomic 
fragments fused to a luxCDABE reporter . 

An independent random library of E. coli genomic fragments in plasmid pDEW201 

25 was constructed and called the LuxZ library. Chromosomal DNA isolated from E. coli strain 
W31 10 (Ernsting et aL, (1992) J. Bacteriol. 174:1 109-1 1 18) was partially digested with the 
restriction enzyme Sau3 Al, size fractionated by agarose gel electrophoreses, and a fraction 
with an average size of approximately 0.7 KB isolated. This fraction was Iigated to 
pDEW201 that had previously been digested with BamHl and treated with calf intestinal 

30 alkaline phosphatase. The ligation products were used to transform ultracompetent E. coli 
XL2Blue cells (Stratagene) to ampicillin resistance using the protocol provided by 
Stratagene. Preliminary characterization of individual XL2Blue transformants that were 
picked in random indicated that a large percentage (16 of 16) contained insert DNA with 
sizes ranging from 0.1 to 1 .5 KB. Approximately 150,000 of these transformants were 

35 pooled and used as a source of heterogeneous LuxZ library plasmid DNA isolated using 
Qiagen tip20 columns (Qiagen Corp). This plasmid DNA pool was diluted 100-fold then 
used to transform E. coli DPD1675 (Van Dyk and LaRossa (1998) Methods in Molecular 
Biology: Bioluminescence Methods and Protocols, Humana Press inc. vol. 102:85-95) by 
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electroporation. Electrocompetent DPD1675 cells were prepared starting with 1 liter of LB 
culture at approximately 2.4 x 10 8 cells/ml. Following chilling on ice, the cells from this 
culture were collected by centrifugation at ah average relative centrifugal force of 6,555 for 
5 minutes. The cells were washed twice ice-cold, sterile distilled water (1 liter for the first 
5 wash and 500 ml for the second wash) and centrifuged. Then the cells were washed with 
10 ml of ice-cold, sterile 10% glycerol in distilled water and centrifuged. The final 
resuspension of the cell pellet was done with ice-cold, sterile 10% glycerol in distilled water 
to yield a final volume of 2.1 ml. 55 ul aliquots were prepared, quick frozen in dry ice and 
ethanol, then stored at -80°C until.used. One such aliquot was thawed on ice, 1.0 \il of the 

10 diluted LuxZ library plasmid DNA was added, and the cells and plasmid DNA were 

transferred to an electroporation cuvette (Biorad, Gene Pulserd^/it. coli Pulser™) on ice. The 
capped cuvette was electroporated with 1 .85 volts at a capacitance of 25 up then 0.5 ml of 
SOC medium (per liter: 20 g tryptone, 5 g yeast extract, 0.5 g NaCl, 2.5 raM KC1, 2.5 mM 
MgCl 2 , 20 mM glucose, pH 7.0) was added. A 30 minute phenotypic expression time at 

15 37°C was used to minimize the presence of siblings. Following this incubation, 500 \xl of 
24% sterile glycerol in water was added and 1 00 ui aliquots were prepared, frozen on dry 
ice, and stored at -80°C until used. These aliquots were thawed, plated on LB medium 
containing ampicillin, and 4608 single colonies were picked for use as inoculum for cultures 
grown in 80 pil lx freezing medium (LB with 100 ng/ml ampicillin, 5.54% glycerol, 36 mM 

20 K 2 HP0 4 , 13.2mM KH 2 P0 4 , 1 .7mM NaC 6 H 5 0 7 :2H 2 0, 40mM MgS0 4 , 6.8mM NH 4 S0 4 ) in 
the wells of 384 well microplates. These plates were stored at -80°C until used for 
inoculation and growth to saturation in Terrific Broth (Gibco-BRL, Inc) containing 
100 fig/ml ampicillin in 96 well deep-well plates. Plasmid DNA was extracted from the 
cultures using the Qiagen R.E. A.L.™ prep method with the following modification: after 

25 lysis of cells, the plates were placed in a boiling water bath for 5 minutes and then rapidly 

chilled in an ice-water bath before precipitation with Buffer R3. This modification prevented 
degradation of the plasmid DNA by the nucleases present in the non-endA host strain, 
DPD1675. DNA sequencing reactions were performed with approximately 1 ug of plasmid 
DNA under standard ABI Prism™ dRhodamine DyeTerminator Ready Reaction conditions 

30 with the primers pDEW201 .forward (SEQ ID NO: 1, 5'-GGATCGGAATTCCCGGGGAT-3 ') 
and pDEW201. reverse (SEQ ID NO:2, 5'-CTGGCCGTTAATAATGAATG-3 ') to obtain 
sequence information from each end of the insert. DNA sequences were determined on 
ABI377™-XL 96-lane upgraded Sequencers under 4x run conditions on 5% PAG 
(polyacrylamide gel) LongRanger™ (FMC, Inc.) gels and analyzed with ABI software. 

35 DNA sequences were transferred to a UNIX based utility for further analysis. The homology 
search for the sequence from the beginning and end of each Lux-Z clone (in both 
orientations) and registration to the E. coli genome was done as described in Example 1 
resulting in 1799 additional genome-registered gene fusions. 
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EXAMPLE 7. 

Lux Array 1.04. a genome- wide, promoter activities assay in liquid medium, 
and its use to discover a limonene sensor. 
Selection of additional non-redundant lux gene fusions to expand LuxArrav 1.0. 
5 Operably linked gene fusions were selected from the genome-registered LuxZ gene 

fusion collection described in Example 6 using the computational filter described in 
Example 5. This list was then compared with the current list of promoters contained in Lux 
Array 1.0 to eliminate redundancy. This process resulted in identification of 149 gene 
fusions useful to expand the LuxArray. 

10 In addition, the five gene fusions generated by inversion of previously sequenced 

gene fusions as described in Example 6 and six gene fusions from other sources were added. 
Table 19 lists these additional 160 gene fusions that together with the gene fusions in 
LuxArray 1 .0 yielded LuxArray 1 .04. The 849 gene fusions in Lux Array 1 .04 thus 
represent 33% of the 2584 known and predicted transcriptional units in the E. coli genome. 

15 Individual cultures of strains containing the newly identified gene fusions were added to 
empty wells of the existing micropiate number 1 5 and new microplates numbered 1 7 to 1 9 
were made with duplicated side by side symmetry. The former micropiate 16 was eliminated 
because it contained only strains with the parental plasmid and sterile controls. Thus, the 
Lux Array 1 .04 is contained in 18 microplates representing the cellular array. These were 

20 stored at -80°C in freezing medium. 

Limonene stress of the reporter array in liquid culture . 

Microplates from -80°C containing the LuxArray 1.04 E. coli strains were thawed 
and used to inoculate 100 \\\ LB medium supplemented with 100 p.g/ml ampicillin in 
Costar#3595 flat bottomed 96-weIl microplates. Following overnight growth at 37°C, the 

25 plates were visually examined and any wells with little or no turbidity were recorded. Then 
these cultures were diluted by transfer of 15 ul to 135jal LB medium in Costar #3595, flat 
bottom 96-well microplates and were moderately swirl shaken on an IKA Schuttler MTS 4 
shaker at setting 400 for two hrs at 37°C in a humidified box. Twenty \xl of each these 
actively growing cultures were transferred to the corresponding well in sets of three 

30 microplates, one containing 80 ul of LB medium, another containing 80 ul of LB medium 
saturated with limonene at room temperature, and the third containing 40 \x\ of LB medium 
saturated with limonene and 40 ul of LB medium. An initial (zero time) reading was taken 
using an MLX micropiate luminometer (Dynex) pre-warmed to 37°C. Then the plates were 
placed at 37°C without shaking. Additional bioluminescent measurements were taken at 45, 

35 90, and 135 minutes. To limit the number of microplates handled each day, LuxArray 1 .04 
microplates 1 to 5, microplates 6 to 10, microplates 1 1 to 15, and microplates 17 to. 19 were 
used on each of four different days. 
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Data analysis of limonene induced responses. 

The data were normalized to correct for growth" inhibition caused by limonene on the 
basis of each individual day that data was collected. The bioluminescence (RLU) of each 
individual culture for each day of experimentation under each of the three conditions, were 
5 added to determine the daily array total RLU. These were used to find a daily normalization 
factor (DNF) as follows: 

DNF=Daily total array signal (time zero, LB control)/Daily total array signal (time x, 
condition y) 

Each measurement from the corresponding day, time point, and condition was 

10 multiplied by DNF to yield a normalized signal. The data from wells that were scored to 
have little or no* growth were not used in further analysis. The normalized data were 
analyzed with GeneSpring (Silicon Genetics) software, which averaged the signal from each 
of the duplicate cultures for each reporter gene fusion. 

Overall, there was little to no upregulation of gene expression under the condition of 

15 40% saturated limonene in LB medium. Thus, patterns of gene expression from the 80% 
saturated limonene in LB medium were further examined. Lists were made of gene fusions 
that were upregulated by 2 fold or more at each time point. These lists were examined and 
gene fusions with very low normalized RLU values (<0.012 RLU) at all conditions were 
eliminated. The upregulated gene fusions are listed in Table 20, which uses the name of the 

20 gene in each operon with the smallest b# to identify each gene fusion. The designation of 
">2X" is given for upregulation while "*" is listed if the expression was less than 2 fold 
increased. At 45 minutes, the expression of fourteen gene fusions was increased; none of 
these was increased 3 fold or higher. At 90 minutes, 37 gene fusions were induced; these 
included all but one of the gene fusions observed at 45 minutes. However, the most highly 

25 upregulated gene fusions at 90 minutes were not previously observed to be upregulated at 
45 minutes. These were fusions of the luxCDABE reporter to the promoter regions ofuhpT 
(3.8X), nirB (3.7X) and narK (3. 6X). At 135 minutes, 39 gene fusions were induced, which 
included 13 fusions not observed at 45 or 90 minutes. At this time point, the most highly 
upregulated gene fusions were those to the promoter regions of uhpT (7.7X), nirB (4. IX) and 

30 katG (4. IX). Thus, the gene fusion to the promoter region of uhpT was the most highly 
upregulated at both the 90 and 135 minute time points. 
Verification of uhpT-lux upregulation by limonene. 

E. coli strain DPD3228 is identical to Iux-a.pk034.b9 the strain containing the uhpT- 
luxCDABE gene fusion. This strain was grown overnight at 37°C in LB medium containing 

35 100 u.g/ml ampicillin and diluted the following day into LB medium and grown to log phase 
at 37°C. 20 pi aliquots of this culture were added to 80 pi of LB medium saturated with 
limonene, to 80 pi of a series of two fold dilutions into LB medium of LB saturated with 
limonene, and to 80 pi of LB medium. Light production was measured in a Luminoskan 
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Ascent (Labsystems) microplate luminometer at 37°C. Figure 10 shows the result. Similar 
to the initial Lux Array observations, a late response to the presence of limonene was 
observed. This response was maximal at the highest concentration of limonene tested, but 
was also detected when limonene was present at 40% or 20% of saturating amounts in LB 
medium. Thus, the upregulation of bioluminescence from this strain by limonene was" 
confirmed, demonstrating its utility as a sensor for limonene. 



65 



WO 0 1 /794 1 9 PCT/US0 1 /1 1 855 



Scheme 1 : PERL script used to filter the lux clone collection for functional reporter 
constructs 

# Filename : luxf ilter2 

#This perl script is designed to compare an operon list 
^containing genome coordinates and directionality 

#and clone ID to a list of clones containing the same and output 
#their intersect to a new file 

open (OPERONS, "< operons_gg.txt") || die "can't open operon list: 

open (OUTPUT, "> full_list2.txt") || die "can't open output file: 
$ ! " ; 

open (CLONES, "< luxclones.txt") U die "can't open clone list: •$!"; 
©clones = <CLONES>; 
close CLONES; 

while ($line = <OPERONS>) { 
print OUTPUT $line; 
print $line; 

if ($line -~ /complement/) { 

($start) - $line »~ /\d+\ . \ . ( \d+ ) / ; 
($op_end) - $line =~ / ( \d+) \ . \ Ad+/ ; 
$promoter = $start+50; 
foreach $clone (©clones) { 

if ($clone =~ /\t-\t/) { 

($clone_start ) = $clone =~ 

/\d+\t(\d+J \t\d+/; 

($clone_end) = $clone =~ 
/(\d+) \t\d+\t\d+/ ; 
if { ($clone__start > ^promoter) && 
($clone_end > $op_end) && 

($clone_end < $start)){ 
print OUTPUT $clone; 
print $clone; 

) 

) 

} 

} else { 

($start) = $line =~ / ( \d+ ) \ . \ . \d+/ ; 
($op_end) - $line — /\d+\ . \ . ( \d+ ) / ; 
$promoter = $start-50; 
foreach $clone { ©clones) { 
if ($clone =~ /\+/) { 

($clone_start ) = $clone =~ 

/ (\d+)\t\d+\t\d+/; 

($clone_end) = $ clone =~ 
/\d+\t(\d+)\t\d+/.; 
if ( ($promoter > $clone_start ) && 
($clone_end < $op_end) && 

($clone_end > $start) ) { 
print OUTPUT $clone; 
print $clone; 

} 

} 

} 

> 

} 

close OUTPUT; 
close OPERONS; 
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Table 18: Luxarrav 1.0 Clone Collection 



Lux ID 



Coding Operon 
strand 



lux-lacZ 
Iux-a.pk007.b9 

Iux-a.pk0017.h6 
Iux-a.pk0022.c6 

Iux-a.pk0026.g3 
Iux-a.pk0032.g2 



lux- 

a.pk0037.dl 1 
lux- 

a.pk0041.fl0 
Iux-a.pk047.dl0 



Iux-a.pk052.f3 
Iux-a.pk058.f5 
Iux-a.pk066.a3 
Iux-a.pk072.e2 
Iux-a.pk078.c5 



complement(360473.. 365529) 

Operon complement(3399029..3400969) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yhdA" 

+ Operon (510865..513092) /note = "predicted operon" /note 
= "ordered genes contained in the operon: b0485 b0486 " 

Operon complement(23253 87..23347 1 2) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b2225 b2226 b2227 b2228 b2229 yfaA " 

Operon complement(120178.. 121551) /note - "documented 
aroP operon" 

+ Operon (4161218..4171626) /note = "predicted operon" 
/note = "ordered genes contained in the operon: btuB murl 
equtB birA " 

Operon complement^ 067734.. 1 073234) /note = "predicted 
operon" /note — "ordered genes contained in the operon: 
bl006bl007bl008bl009bl010bl011 bl012 " 
+ Operon (342 1 08..343 1 57) /note = "predicted operon" /note 
= "ordered genes contained in the operon: b0325 " 

Operon complement(4592507..45933 1 3) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yjjM " 

+ Operon (3246594. .324801 6) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yqjC yqjD 
yqjE b3100 " 

Operon complement(2820162..2820662) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
oraA " 

Operon complement 199 8496.. 200 1629) /note = "predicted 
operon" /note = "ordered genes contained in the operon: fliZ 
fliA fliC " 

+ Operon (2543793. .2547426) /note == "predicted operon" 

/note = "ordered genes contained in the operon: b2428 b2429 
b2430 " 

Operon complement^ 1 17613.31 19295) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b2975 " 
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+ 



Operon complement(43 1 1389.. 4322743) /note = 
"documented phnCDE-b4 1 03-phnFGHIJKLMNOPQ 
operon" 

Operon complement(1734145.. 1735314) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
bl657 " 

Operon complement(225 5449.. 22573 16) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yeiN yeiC " 

Operon (3714927..3715913) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yiaE" 

Operon (801 1 10..802543) /note = "predicted operon" /note 
= "ordered genes contained in the operon: b0770 " 

Operon (841 91. .87848) /note = "documented leuO-ilvIH 
operon" 

Operon (214291. .215979) /note = "predicted operon" /note 
= "ordered genes contained in the operon: yaeQ yaeJ cutF " 
Operon (8S>634..1 03 153) /note = "predicted operon" /note = 
"ordered genes contained in the operon: yabB yabC ftsL ftsl 
murE murF mraY murD ftsW murG murC ddlB " 

Operon (3542470..3543201) /note - "predicted operon" 
/note = "ordered genes contained in the operon: yhgH " 

Operon complement(3334190..3334459) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yrbA " 

Operon (4233 811. .42373 09) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yjbF yjbG 
yjbH" 

Operon complement(578407..578859) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b0558 " 

Operon complement(4297143..4300516) /note = "predicted 
operon" /note = "ordered genes contained in the operon: yjcP 

yjcQ" 

Operon (797809..7988040 /note = "predicted operon" /note 
= "ordered genes contained in the operon: ybhE " 

Operon complement(1336594.. 13371 84) /note = "predicted 
operon" /note = "ordered genes contained in the operon: ribA 



Iux-a.pk086.h2 



lux-lacZ 



Operon complement(3390094..3393895) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yhdPyhdR" 

complement(360473..365529) 
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Operon complement(21 1 1456..21 12349) /note = "predicted 
operon" /note = "ordered genes contained in the operon: galF 

Tl 

Operon (4140109..4141523) /note - "predicted operon" 
/note = "ordered genes contained in the- operon: frwC frwB " 

Operon (2693959..2695377) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yfhD " 

Operon complement(3598659..3601874) /note = 
"documented ftsYEX operon" 

Operon (274525..276871) /note = "predicted operon" /note 
= "ordered genes contained in the operon: b0260 b0261 " 

Operon (4401 964.. 44021 61) /note = "predicted operon" 
/note = "ordered genes contained in the operon: b4176 " 

Operon complement(2904665.. 2905963) /note = "predicted 
operon" /note = "ordered genes contained in the operon: eno 

Operon (4152580..41 55802) /note = "documented argCBH 
operon" 

Operon complement^ 852741. .3 853934) /note = "predicted 
operon" /note = "ordered genes contained in the operon: yidF 
yidG yidH " 

Operon (621 523.-622773) /note = "predicted operon" /note 
= "ordered genes contained in the operon: ybdA " 

Operon (284619..287623) /note = "predicted operon" /note 
= "ordered genes contained in the operon: b0270 b0271 " 

Operon complement(20614 10.. 2063786) /note = 
"documented cobUST operon" 

Operon complement(3092119..3093144) /note « "predicted 
operon" /note = "ordered genes contained in the operon: 
b2950 " 

Operon complement 1952602.. 19561 56) /note = "predicted 
operon" /note = "ordered genes contained in the operon: bisZ 
bl873 " 

Operon complement(3767870..3769371) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yibH yibl 

Operon complement(2506481 ..2507446) /note = "predicted 

operon" /note = "ordered genes contained in the operon: glk 
ii 

Operon (4048927..4049436) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yihl " 
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Operon complement(783 1 05. .784046) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b0752 " 

Operon (17794 19.. 1782701) /note = "predicted operon" 
/note = "ordered genes contained in the operon: bl 699 bl 700 
ydiD " 

Operon complement^ 9782 12.. 1 98041 1) /note = 
"documented otsAB _ operon" 

Operon complement(32901 16..3290976) /note = "predicted 
operon" /note = "ordered genes contained in the operon: yraL 

Operon complement(1551996..1553720) /note « "predicted 
operon" /note = "ordered genes contained in the operon: sfcA 

Operon (2932257..2938121) /note = "documented 
fucPIKUR operon" 

Operon complement^ 17900.. 3 19252) /note « "predicted' 
operon" /note = "ordered genes contained in the operon: 
b0304 " 

Operon complement(3302214..3303458) /note = 
"documented mtr operon" 

Operon (87860..89032) /note = "predicted operon" /note = 
"ordered genes contained in the operon: fruL fruR " 

Operon complement(3553466..3555711) /note — "predicted 
operon" /note = "ordered genes contained in the operon: yhgj 
yhgKyhgL" 

Operon (4257900..4258885) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yjbL yjbM " 

Operon (594823..596196) /note = "predicted operon" /note 
= "ordered genes contained in the operon: b0572 " 

complement(3 60473 ..3 65529) 

Operon (794312..796835) /note = "documented modABC 
operon" 

Operon (1635056..1635481) /note = "predicted operon" 
/note = "ordered genes contained in the operon: bl549 " 

Operon (81 6267..8 18970) /note = "documented 
moaABCDE operon" 

Operon (1879936.. 1881021) /note = "predicted operon" 
/note = "ordered genes contained in the operon: bl800 " 

Operon (4213057..421 791 1) /note = "documented aceBAK 
operon" 

Operon (2232053. .2234520) /note = "predicted operon" 
/note = "ordered genes contained in the operon: b2146 yeiA " 
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+ Operon (2493599.-2494585) /note = "predicted operon" 
/note = "ordered genes contained in the operon: b2378 " 
Operon complement(2239830..2241672) /note = "predicted 
operon" /note = "ordered genes contained in the operon: yeiB 
folE n 

Operon complement(3748758..3749498) /note = "predicted 
operon" /note = "ordered genes contained in the operon: yiaT 



Operon complement(3 944752.. 3 945590) /note = "predicted 
operon" /note =' "ordered genes contained in the operon: yifA 
pssR " 

Operon (3500404..3502421) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yhfP yhfQ 
yhfR" 

Operon complement(l 577657.. 1580581) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b 1497 b 1498" 

Operon (2680877..2682076) /note = "predicted operon" 
/note = "ordered genes contained in the operon: b2550 " 

Operon complement(1234932.. 1236464) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
ycgB " 

Operon complement(4268628..4271450) /note = 
"documented uvr A operon" 

Operon (42485 3 4.. 4249 862) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yjbl M 

Operon complement(2245083..2246552) /note = "predicted 
operon" /note = "ordered genes contained in the operon: lysP 



Operon (236391 5..2371298) /note = "predicted operon" 
/note = "ordered genes contained in the operon: b2253 b2254 
b2255 b2256 b2257 b2258 " 

Operon complement 176295 8.. 1766709) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b!685bl686bl687" 

Operon (3676830..3677978) /note = "predicted operon" 
/note = "ordered genes contained in the operon: kdgK " 

Operon complement(2097884..2099290) /note = "predicted 
operon" /note = "ordered genes contained in the operon: gnd 

Operon complement(4228938..4229210) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yjbD " 
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Operon complement(280053.. 281207) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yagA " 

Operon complement(2769861 ..2770706) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b2638 b2639" 

Operon (3432844.343 5531) /note = "predicted operon" 
/note = "ordered genes contained in the operon: fmu trkA " 

Operon (441 1 83 8.. 44 134780 /note = "predicted operon" 
/note = "ordered genes contained in the operon: aidB " 

Operon (3483757.3484389) /note = "documented crp 
operon" 

Operon (3491648..3496838) /note = "documented nirBDC- 
cysG operon" 

Operon complement(l 232399.. 1233940) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
nhaB " 

Operon complement^ 694486.. 1695076) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
gusR " 

Operon (2837547..2840437) /note = "documented ascFB 
operon" 

Operon complement(605488.. 606606) /note = "predicted 
operon" /note - "ordered genes contained in the operon: 
b0581 " 

Operon (208821 4..2095247) /note = "documented 
hisGDCBHAFI operon" 

Operon complement(344890..345561) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b0328 " 

Operon (1808223..1808825) /note = "predicted operon" 
/note = "ordered genes contained in the operon: M728 " 

Operon complement(2474714..2475649) /note = 
"documented dsdC operon" 

Operon complement(1838807. .1839433) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
bl758 " 

Operon (2861616..2863035) /note = "predicted operon" 

/note = "ordered genes contained in the operon: b2738 b2739 
ii 

Operon complement(464836.. 466536) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
ybaE " 
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Operon (2775136..2775803) /note = "predicted operon" 
/note = "ordered genes contained in the operon: b2645 b2646 

ir 

Operon complement(820765.. 823720) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b0788 b0789 b0790 " 

Operon (1418389..1421668) /note = "predicted operon" 
/note = "ordered genes contained in the operon: M357 bI358 
bl359bl360bl361 bl362 " 

Operon complement 177001. .179153) /note — "predicted 
operon" /note = "ordered genes contained in the operon: 
yadS yadT pfs " 

Operon complement(2992482..2993 1 14) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b2855 b2856" 

Operon (2857783.-2858439) /note = "predicted operon" 
/note « "ordered genes contained in the operon: b2734 " 

Operon complement(4429669..4430643) /note = "predicted 
operon" /note = "ordered genes contained in the operon: ytfF 

Operon complement(3077663.. 3079654) /note = "predicted 
operon" /note = "ordered genes contained in the operon: tktA 

u 

Operon (3632372..3633523) /note = "predicted operon" /note 
- "ordered genes contained in the operon: yhiM " 
Operon (997713..1003880) /note = "predicted operon" /note 
= "ordered genes contained in the operon: b0939 b0940 
b0941 b0942 b0943 ycbF " 

Operon (4100373..4101077) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yiiM " 

Operon complement^ 90 877 8.. 290 9361) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
chpA chpR " 

Operon (3237584.3238828) /note = "predicted operon" 
/note = "ordered genes contained in the operon: ygjU " 

Operon complement(2083726..2085090) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yeeF " 

Operon (3841 591. .3843357) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yicP " 

Operon complement 1325791.. 1327 13 6) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
btuRyciK" 
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Operon complement^ 889349.. 1891259) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
bl808 " 

Operon complement(43 9426.. 4405 67) /note = "documented 
xseB-ispA operon" 

Operon complement(346081.. 347667) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b0330 n 

Operon (252005. .253 161) /note = "predicted operon" /note 
= "ordered genes contained in the operon: yafN yafO yafP " 

Operon complement(2591092..2594757) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b2473 b2474 b2475 " 

Operon complement(2420669..2421559) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b2305 " 

Operon (980270..982117) /note = "predicted operon" /note 
= "ordered genes contained in the operon: ycbB " 

Operon (774376..778255) /note = "documented tolQRAB 
operon" 

Operon complement(691097..692640) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b0659 b0660 " 

Operon complement(2 1 1 2524. .2 1 1 6426) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b2043 b2044 b2045 " 

Operon complement(4343258..4344904) /note = 
"documented fumB operon" 

Operon (341 6786..341 8859) /note = "predicted operon" 

/note = "ordered genes contained in the operon: yhdW yhdX 
ti 

Operon (384399..387870) /note « "predicted operon" /note 
= "ordered genes contained in the operon: b0365 b0366 
b0367 b0368 " 

Operon (4010643..4012904) /note = "documented metE 
operon" 

Operon (4349421. .434993 5) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yjdl yjdJ " 

Operon complement(2254105..2255355) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yeiM"- 

Operon complement 1 990897.. 1 992663) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
uvrC" 
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Operon complement(4452185..4453183) /note = "predicted 

operon" /note = "ordered genes contained in the operon: fbp 
ii 

Operon (2278652..2280412) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yejH " 

Operon complement(59687.. 63264) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yabO hepA " 

Operon (71351 . .72 115) /note = "predicted operon" /note = 
"ordered genes contained in the operon: yabl " 

Operon (2890237..2892794) /note = "predicted operon" 
/note = "ordered genes contained in the operon: b2765 b2766 
b2767 b2768" 

Operon (15445. .16557) /note = "predicted operon" /note = 
"ordered genes contained in the operon: yi81_l " 

Operon complement(45 90931.. 45 92292) /note = "predicted 
operon" /note = "ordered genes contained in the operon: yjiZ 

Operon (471822..473476) /note = "predicted operon" /note 
= "ordered genes contained in the operon: glnK amtB rr 

Operon complement(904963..906012) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b0868 " 

Operon (3208422..3212529) /note = "documented rpsU- 
dnaG-rpoD operon" 

Operon complement(l 844989.. 1 846032) /note = 
"documented selD operon" 

Operon (44 14530..44 15279) /note « "predicted operon" 
/note = "ordered genes contained in the operon: yjfP " 

Operon (33442 19..3345879) /note = "predicted operon" 
/note = "ordered genes contained in the operon: ptsN b3205 
ptsO" 

Operon complement(271551 1..2716548) /note = "predicted 
operon" /note = "ordered genes contained ih the operon: yfiF 

Operon (3637741 ..3638175) /note = "predicted operon" 
/note = "ordered genes contained in the operon; uspA " 
Operon (234798. .235538) /note = "predicted operon" /note 
= "ordered genes contained in the operon: b0213 " 
Operon complement(4308686..4309621) /note = "predicted 
operon" /note = "ordered genes contained in the operon: • 
yjcX " 
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Iux-a.pk0001.f2 
Iux-a.pk009.a4 

Iux-a.pk0018.g6 
Iux-a.pk0023.e3 
Iux-a.pk028.g6 

Iux-a.pk033.e6 ' 

Iux-a.pk038.a4 

Iux-a,pk042.d9 
Iux-a.pk048.g4 
Iux-a.pk053.d8 

Iux-a.pk060.e8 

Iux~a.pk067.f8 

Iux-a.pk073.f3 

Iux-a.pk079.g6 
Iux-a.pk087.g3 



lux- 

a.pk0002.M2 
Iux-a.pk009.a9 



Operon complement(2426077..2429677) /note = 
"documented dedD-cvpA-purF-ubiX operon" 

+ Operon (3579494.3 58 1811) /note = "predicted operon" /note 
= "ordered genes contained in the operon: yhhZ b3443 
insA_5 insB_5" 

+ Operon (2415080..2416621) /note = "predicted operon" 
/note = "ordered genes contained in the operon: b2298 " 

+ Operon (3223875.3225308) /note = "predicted operon" 
/note = "ordered genes contained in the operon: ygjl " 

Operon complement(3663810.3665210) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
gadA " 

Operon complement(3 825087.. 3 826292) /note = "predicted 

operon" /note = "ordered genes contained in the operon: gltS 
t» 

Operon complement^ 6233 10.3628232) /note = "predicted 
operon" /note = "ordered genes contained in the operon: yhhJ 
yhiGyhil" 

+ Operon (1 864932.. 1 866866) /note = "predicted operon" 
/note - "ordered genes contained in the operon: bl783 " 

+ Operon (30293 1 8.3030835) /note = "predicted operon" 
/note - "ordered genes contained in the operon: b2888 " 

Operon compIement(2523 9 50.. 2524876) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b2409" 

Operon complement(740298..741779) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b0709 " 

Operon complement(1921389..1922993) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
bl839M840bl841 " 

Operon complement(l 51 5906.. 151 6870) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
bl447bl448" 

Operon complement(98 8 377.. 989579) /note = "documented 
pncB operon" 

Operon complement(3475544..3476134) /note = "predicted 

operon" /note = "ordered genes contained in the operon: slyD 
ii 

+ Operon (484985.-485632) /note = "predicted operon" /note 
= "ordered genes contained in the operon: acrR " 
Operon complement(232597..233955)/note = "predicted 
operon" /note = "ordered genes contained in the operon: 
dniR" 
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Iux-a.pk0018.h2 + 
Iux-a.pk023.f2 

Iux-a.pk029.b2 + 

Iux-a.pk033.f3 + 
Iux-a.pk038.bl0 + 
Iux-a.pk042.el0 

Iux-a.pk049.a4 

Iux-a.pk053.e9 

lux-a.pk060.gl 

Iux-a.pk068.a3 
Iux-a.pk073.h2 + 
Iux-a.pk079.h9 

Iux-a.pk088.c3 

lux-lacZ 
Iux-a.pk009.b2 



lux- 

a.pk0019.al2 
Iux-a.pk0023.h6 



Operon (3782S87..3785724) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yibO yibP " 

Operon complement(2030406..2031524) /note = "predicted 
operon" /note = "ordered genes contained in the "operon: yedJ 
bl963 " 

Operon (3972208..3978309) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yifH yifl yifl 
rffT rffM " 

Operon (945094..947882) /note « "predicted operon" /note 
= "ordered genes contained in the operon: ycaD b0899 " 

Operon (1036963..1041 138) /note - "documented appCBA 
operon" 

Operon complement(3464797.. 3467490) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yheB " 

Operon complement(1946774.. 1948546) /note = "predicted 
operon" /note = "ordered genes contained in the operon: aspS 

Operon complement(2077555.. 2078613) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yeeA" 

Operon complement(1823979.. 1830006) /note - "predicted 
operon" /note = "ordered genes contained in the operon: 
bl744 M745 b!746 b!747 b!748 " 

Operon complement(926697..9301 85) /note = "documented 
cydCD operon" 

Operon (3646158.3648292) /note = "documented arsRBC 
operon" 

Operon complement^ 532989.. 1533882) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yddE " 

Operon complement(264844..266191) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b0250b0251 " 

complement(360473..365529) 

Operon complement(1397745..1402604) /note = "predicted 
operon" /note = "ordered genes contained in the operon: ogt 
ydaHbl337bl338 n 

Operon complement^ 065 3 60.. 3 066 100) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 

yggE" 

Operon complement^ 7974 1 7. . 1 8005.94) /note = 
"documented thrS-infC-rpml-rplT operon" 
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Iiix-a.pk029.b8 

Iux-a.pk033.f6 
Iux-a.pk038.b6 
lux-a.pk042.gl 
Iux-a.pk049.a8 
Iux-a.pk053.g7 . 

Iux-a.pk061.a5 

Iux-a.pk068.b3 

lux-a.pk074.cl 1 

lux-a.pk080.dl 

Iux-a.pk088.c3 

Iux-a.pk0002.b7 

lux-a.pk009.cll 

Iux-a.pk0019.a8 
Iux-a.pk0024.a3 
Iux-a.pk029.cl0 

Iux-a.pk033.gl0 



Operon complement(3487903..3489257) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
pabA fic yhfG " 

+ Operon (2380733..238 1944) /note = "predicted operon" 

/note = "ordered genes contained in the operon: b2269 " 
+ Operon (60 1 1 82..602558) /note = "predicted operon" /note 

= "ordered genes contained in the operon: pheP " 
+ Operon (2898614..2901 396) /note = "predicted operon" 

/note = "ordered genes contained in the operon: + ygcE " 
+ Operon (2556791 ..2558086) /note = "predicted operon" 

/note = "ordered genes contained in the operon: b2442 " 

Operon complement(2180055..2180801) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b2101 " 

Operon complement(3838176..3841494)/note = "predicted 
operon'Vnote -'ordered genes contained in the operon: yicM 
yicN yicO " 

Operon complement 1 1 4407. . 1 1 7549) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
bO 1 05 hofC hofB ppdD " 

+ Operon (3555900..3557498) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yhgB " 

+ Operon (209679..2 12266) /note = "predicted operon" /note 
= "ordered genes contained in the operon: ldcC b0187 " 
Operon complement(264844..2661 91 ) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b0250b0251 " 

Operon complement(273325.. 274341) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yi52_l " 

Operon complement(3080896..3081816) /note = "predicted 
operon'Vnote = "ordered genes contained in the operon: speB 

+ Operon (1 89642 1 1 898049) /note = "predicted operon" 
/note = "ordered genes contained in the operon: bl815 " 

■f- Operon (12443 83.. 1244823) /note = "predicted operon" 
/note = "ordered genes contained in the operon: bl 196 " 

Operon complement(l 550422.. 1 550784) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
bl477 " 

Operon complement^ 566978.. 15685 13) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
xasA " 
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Iux-a.pk038.c2 + 

lux-a.pk043.cl 
Iux-a.pk049.d8 

Iux-a.pk054.a6 

Iux~a.pk061.b6 

Iux-a.pk068.b8 + 

Iux-a.pk074.d9 • + 

Iux-a.pk080.el2 + 

Iux-a.pk088.d5 

lux-lacZ 
Iux-a.pk009.d6 

Iux-a.pk0019.a9 + 

Iux-a.pk0024.b6 + 

lux-a.pk029.cll + 

lux-a.pk033.gll + 

Iux-a.pk038.c7 + 

lux-a.pk043.ell 
Iux-a.pk049.g7 

Iux-a.pk054.cl0 



Operon (1599514.. 16053 13) /note = "predicted operon" 
/note = "ordered genes contained in the operon: bl513 bl 514 
bl515bl516b!517bl518" 

Operon complement(2929887..2931710) /note = 
"documented fucAO operon" 

Operon complement(5683..6459) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yaaA " 

Operon complement(4350778..4352295) /note = 
"documented lysU operon" 

Operon complement(181 5172..1 819643) /note = 
"documented celABCDF operon" 

Operon (1473162..1475474) /note = "predicted operon" 
/note = "ordered genes contained in the operon: ydbD " 

Operon (1903658..1904278) /note = "predicted operon" 
/note = "ordered genes contained in the operon: bl821 " 

Operon (243543..244121) /note = "predicted operon" /note 
= "ordered genes contained in the operon: gmhA " 

Operon complement^ 1 0503 8..3 1 07233) /note = 
"documented speC operon" 

compiement(3 60473 . .3 65529) 

Operon complement(642780..643 190) /note = "predicted 
operon" /note = "ordered genes contained in the operon: rnk 

Operon (3484440..3486530) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yhfiC " 
Operon (122092..129336) /note = "documented pdhR- 
aceEF-lpdA operon" 

Operon (4506526..4507122) /note = "predicted operon" 
/note = "ordered genes contained in the operon: b4285 " 

Operon (1289465.. 1290478) /note = "predicted operon" 
/note = "ordered genes contained in the operon: hnr " 

Operon (2609941. .26 12802) /note = "predicted operon" 
/note = "ordered genes contained in the operon: b2491 b2492 

Operon complement(2885601..2889921) /note = 
"documented cysJIH operon" 

Operon complement^ 886085.. 1 887770) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
fadD " 

Operon complement(33 17629..33 19272) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yhbX" 
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Iux-a.pk061.b7 
Iux-a.pk068.b9 
Iux-a.pk074.e3 

Iux~a.pk080.g3 
lux-a.pk088.el 
Iux-a.pk0002.e4 
lux-a.pk009.dll 



lux- 

a.pk0019.cll 
Iux-a.pk0024.c7 



Iux-a.pk029.c5 

Iux-a.pk033.g2 

Iux-a.pk038.d9 
lvix-a.pk043.fl 1 
Iux-a.pk049.h9 
Iux-a.pk054.c3 

Iux-a.pk061.c3 

Iux-a.pk068.h2 



Operon (180884.. 182308 /note = "predicted operon" /note = 
"ordered genes contained in the operon: htrA " 

Operon (467520.. 471 641 /note = "predicted operon" /note = 
"ordered genes contained in the operon: b0447 mdlA mdlB " 

Operon complement(4594719..4596971) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
mdoB " 

Operon (655780..656340) /note = "predicted operon" /note 
— "ordered genes contained in the operon: ybeG " 

Operon complement 1 932863.. 1934338) /note = 
"documented zwf operon" 

Operon (1735868.. 1736893) /note = "documented purR 
operon" 

Operon complement(4363050..4364351) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
dcuA " 

Operon (2621064..2623130) /note = "documented ppk 
operon" 

Operon complement^ 92 5 5 1 ..594666) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b0570 b0571 " 

Operon complement(266408.. 267244) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b0252" 

Operon complement(2 1 3 1 5 12..2 135265) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b2060b2061 b2062 " 

Operon complement(65855.. 70048) /note = "documented 
araBAD operon" 

Operon (4200898..4202223) /note = "predicted operon" 
/note = "ordered genes contained in the operon: hydG " 

Operon (3352267..3359575) /note = "documented gltBDF 
operon" 

Operon complement(2729620..2732193) /note = "predicted 
operon" /note = "ordered genes contained in the operon: clpB 



+ Operon (202560..208608) /note = "predicted operon" /note 
= "ordered genes contained in the operon: lpxA lpxB rnhB 
dnaE" 

+ Operon (1 03 155.. 106456) /note = "documented ftsQAZ 
operon" 



+ 
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lux-a.pk074.fl 

Iux-a.pk081.b5 

Iux-a.pk088.g2 

lux-lacZ- 
Iux-a.pk009.f3 



lux- 

a.pk0019.dl2 
lux-a.pk0024.el 



Iux-a.pk029.c5 
Iux-a.pk033.g7 
lux-a.pk038.ell 
Iux-a.pk043.f6 

lux-a.pk050.all 

lux-a.pk054.el 1 

Iux-a.pk061.d8 
Iux-a.pk068.h4 
Iux-a.pk074.g9 

Iux-a.pk081.b6 
Iux-a.pk088.h3 



+ Operon (2550372..2552144) /note = "predicted operon" 
/note = "ordered genes contained in the operon: amiA hemF 

+ Operon (3728760..3733786) /note - "documented 

xylFGHR operon" 
+ Operon (882896.. 884128) /note = "predicted operon" /note 

= "ordered genes contained in the operon: b0842 " 

complement(3 60473. .3 65 529) 

fl76; This 176 aa ORF is 45 pet identical (2 gaps) to 172 
residues of an approx. 1 84 aa protein FIMFJECOLI S W: 
P08189 

Operon complement(4061 1 82..4066856) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yihO yihO yihP yihQ " 

Operon complement(2854476..2854829) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
ygbA " 

+ Operon (2766686..2767507) /note = "predicted operon" 
/note = "ordered genes contained in the operon: b2633 " 

+ Operon (2001 895.. 20041 01) /note = "documented fliDST 
operon" 

+ Operon (3785S54..3786687) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yibQ 11 

Operon complement(3040509..3041 1 68) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b2899 " 

Operon complement(443 7449.. 443 8792) /note = "predicted 

operon" /note = "ordered genes contained in the operon: ytfL 
ii 

Operon complement(371 1690..3713909) /note = "predicted 
operon" /note = "ordered genes contained in the operon: bisC 

Operon complement^ 148833. .3 149999) /note = 
"documented exbDB operon" 
+ Operon (3 1805 66.. 31 81339) /note - "predicted operon" 
/note = "ordered genes contained in the operon: ygiE " 

Operon complement(4000900.. 4002326) /note = "predicted 
operon" /note = "ordered genes contained in the operon: rarD 
yigl " 

+ Operon (879950..881 152) /note = "predicted operon" /note 
= "ordered genes contained in the operon: dacC " 

+ Operon (122092.. 129336) /note = "documented pdhR- 
aceEF-lpdA operon" 
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Iux-a.pk0002.g5 
Iux-a.pk009.g3 

Iux-a.pk0019.e3 
Iux-a.pk0024.e6 
Iux-a.pk029.c9 

lux-a.pk034.all 

lux-a.pk038.fl 
Iux-a.pk043.g5 

Iux-a.pk050.al2 
Iux-a.pk054.e5 

Iux-a.pk061.e9 

Iux-a.pk069.a4 

Iux-a.pk074.hl0 
Iux-a.pk081.c2 

lux-a.pk089.al 1 

Iux-a.pk0003.a2 

Iux-a.pk009.g7 



+ Operon (2342885. .2346534) /note = "documented nrdAB 
operon" 

Operon complement(21 8 1736..2 183321) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b2103b2104" 

Operon complement(4012721..4013719) /note = "predicted 
operon" /note = "ordered genes contained in the operon: " 
+ Operon (893007..897152) /note = "documented potFGHI 
operon" 

+ Operon (89634.. 103 153) /note = "predicted operon" /note = 
"ordered genes contained in the operon: yabB yabC ftsL ftsl 
murE murF rnraY murD ftsW murG murC ddlB " 
Operon complement(2 1 1 6702..2 1 1 9576) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b2046 b2047" 

+ Operon (1 830452.. 1 83 1258) /note = "predicted operon" 
/note = "ordered genes contained in the operon: xthA " 
Operon complement(1957304..1957876) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
bl875 " 

Operon complement(4338298..4339206) /note = 
"documented melR operon" 

Operon complement^ 85 8976. .3861491 )/note ^"predicted 
operon'Vnote ="ordered genes contained in the operon: glvG 
glvBglvC" 

Operon complement(29805 1 9. .2982 1 46) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
kduDkdul" 

Operon complement (3 02 65 44.. 3 02 8 966) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b2886 b2887" 

+ Operon (41 09895..41 10335) /note = "predicted operon" 
/note = "ordered genes contained.in the operon: yiiR " 
Operon complement(4457474. .445793 8) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
nrdG" 

+ Operon (1 094746.. 1 096052) /note = "predicted operon" 

/note = "ordered genes contained in the operon: bl028 bl029 

+ Operon (349236..353816) /note = "predicted operon" /note 
= "ordered genes contained in the operon: b0333 b0334 
b0335 " 

FecR protein 
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Iux-a.pk0019.e4 
Iux-a.pk0024.f5 
Iux-a.pk029.e7 

Iux-a.pk034.b8 
Iux-a.pk038.f3 

lux-a.pk043.hl ' 
Iux-a.pk050.b9 
Iux-a.pk054.f7 
Iux-a.pk061.fl0 

Iux-a.pk069.a6 

Iux-a.pk075.al0 
Iux-a.pk081.e3 

Iux-a.pk089.c4 

lux-a.pk0003.cl 

lux- 

a.pk0010.a!2 
Iux-a.pk0019.f8 

Iux-a.pk0024.f7 
Iux-a.pk029.g2 



+ Operon (1303788..1304792) /note = "predicted operon" 
/note = "ordered genes contained in the operon: oppF " 

+ Operon (832293.-835433) /note = "predicted operon" /note 
= "ordered genes contained in the operon: dinG ybiB " 
Operon complement^ 852120. J 852878) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
bl770 " 

+ Operon (4126252..41 29847) /note = "documented metBL 
operon" 

Operon complement(303719..309250) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b0289 b0290 b0291 b0292 " 

+ Operon (2383874.-238485 1) /note = "predicted operon" 
/note = "ordered genes contained in the operon: b2271 " 

Operon complement^ 60473.. 365529) /note = "documented 
lacAYZ operon" 

+ Operon (1057307..1061621) /note « "documented torCAD 
operon" 

Operon complement(2612840..2613901) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b2493 " 

Operon complement(4372207..4373235) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yjeK " 

+ Operon (4609980..461 1053) /note - "predicted operon" 
/note = "ordered genes contained in the operon: yjjU " 
Operon complement(2024345..2026054) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
M956 " 

.+ Operon (434858..436331) /note = "predicted operon" /note 

= "ordered genes contained in the operon: b0417 b041 8 " 
+ Operon (1216509.. 121 8074) /note = "predicted operon" 

/note = "ordered genes contained in the operon: bl 168 " 
+ Operon (738224..740148) /note = "predicted operon" /note 

= "ordered genes contained in the operon: ybgA phrB " 
+ Operon (970975..971 868) /note = "predicted operon" /note 

= "ordered genes contained in the operon: b0919 " 

Operon complement(4555923.. 4558261) /note = "predicted 
operon" /note — "ordered genes contained in the operon: 
iadA yjiG yjiH " 

Operon complement^ 03 80.. 5 1222) /note = "predicted • 
operon" /note = "ordered genes contained in the operon: 
apaH " 
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Operon complement(3 843 403.. 3 844794) /note = 
"documented uhpT operon" 

Operon (22 1 2886. .22 1 36 1 7) /note = "predicted operon" 
/note= "ordered genes contained in the operon: yehV " 

Operon complement(3451 145. .3453035) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
pinO yheD " 

Operon complement(2759372..2763174) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b2627 b2628" 

Operon (1 95785. .200360)/note = "predicted operon" /note = 
"ordered genes contained in the operon: cdsA yaeL b0177 " 

Operon (4382971. .4383596) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yjeN yjeO " 

Operon complement(3673920..3675995) /note = "predicted 
operon" /note = "ordered genes contained in the operon: yhjF 

M 

Operon (21 92320.. 2 194353) /note = "predicted operon" 
/note = "ordered genes contained in the operon: metG " 

Operon (3031085..3031633) /note = "predicted operon" 
/note = "ordered genes contained in the operon: b2889 " 

Operon complement(4091029..4095029) /note = 
"documented rhaBAD operon" 

Operon complement(1246919.. 1250091) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
ycgCbll99bl200" 

Operon (1923464.. 1924806) /note = "predicted operon" 

/note = "ordered genes contained in the operon: bl843 bl844 
it 

Operon 20 10524.. 20 10802 /note = "predicted operon" /note 
= "ordered genes contained in the operon: b!936 " 

Operon complement(394354..39551 1) /note « "predicted 
operon" /note = "ordered genes contained in the operon: 
yaiH " 

Operon complement(41 12149..41 13159) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
glpX" 

Operon complement(45 19695.. 4523371) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yjhGyjhHyjhl" 

Operon complement(1654771..1655517) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
bl585 " 
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Iux-a.pk044.b7 
lux-a.pk050.el 

Iux-a.pk055.al2 

Iux-a.pk061.f5 
Iux-a.pk069.d3 
Iux~a.pk075.d9 
Iux-a.pk081.f3 

Iux-a.pk089.e2 



lux- 

a.pk0003.e!2 
Iux-a.pk0010.c7 



Iux-a.pk0019.g2 

Iux-a.pk0024.g3 

Iux-a.pk030.b3 

Iux-a.pk034.d3 

Iux-a.pk038.h6 

Iux-a.pk044.dl0 



Operon complement(4097072..4098106) /note = 
"documented rhaT operon" 

Operon complement(2906051.. 290768 8) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
pyrG " 

Operon complement^ 275497.. 3276306) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
agaR " 

Operon complement(39241 73 ..392463 1) /note = 
"documented asnC operon" 

Operon complement^ 125 6 5 8.. 4 125975) /note = 
"documented metJ operon" 

Operon 3220238..3223812 /note = "documented ebgAC 
operon" 

Operon complement(2570177..2573897) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
cchA eutl b2459 b2460 b2461 b2462 " 

Operon complement(2624715..2626958) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b2503 " 

Operon (4022578..4024818) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yigC ubiB " 

Operon complement(3750593..3752058)/note = "predicted 
operon'Vnote = "ordered genes contained in the operon: yiaV 
yiaW" 

Operon complement^ 654 5 5 6.. 265 74 8 7) /note = "predicted 
operon" /note = "ordered genes contained in the operon: yfhJ 
fdx hscA yfhE " 

Operon (2032043.. 2032777) /note = "predicted operon" 
/note = "ordered genes contained in the operon: bl964 bl965 

Operon complement(4272339..4272689) /note = "predicted 
operon" /note = "ordered genes contained in the operon: yjcB 

Operon complement(285849p..2859287) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b2735 " 

Operon complement(244191 L. 2446793) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b2325 b2326 yfcA mepA aroC yfcB » 

Operon complement(3668922..3669524) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yhjB " 
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Iux-a.pk050.el0 + 

Iux-a.pk055.a3 + 
Iux-a.pk061.h3 

Iux~a.pk069.d8 + 
lux-a.pk075.fll 

Iux-a.pk081.gl0 + 
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Iux-a.pk0010.d6 
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Operon (4439959..4445812) /note = "predicted operon" 
/note = "ordered genes contained in the operon: ytfM ytfN 
ytfP" 

Operon (250898..251953) /note = "predicted operon" /note 
= "ordered genes contained in the operon: " 

Operon complement^ 8 93 3 4.. 49003 6) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
ybaM priC " 

Operon (42423 5.. 429700) /note = "documented queA-tgt- 
yajC-secD-secF operon" 

Operon complement(2478658..2481359) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
emxY emxK " 

Operon (177361 1..1776371) /note = "predicted operon" 
/note = "ordered genes contained in the operon: ydiF bl695 11 

Operon complement(2563501.. 2570070) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b2451 eutH eutG eutJ eutE cchB " 

Operon complement 1252308.. 1255 175) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
M202 " 

Operon complement(2181736..2183321) /note = "predicted 
operon'Vnote = "ordered genes contained in the operon: 
b2103b2104" 

Operon complement(805221.. 806504) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
ybhC " 

Operon compiement(3463886..3464362) /note = "predicted 
operon" /note = "ordered genes contained in the operon: bfr " 

Operon (208621. .209580) /note = "documented accA 
operon" 

Operon (4501566..4503973) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yjhB yjhC " 

Operon complement(437539..439401) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b0420" 

Operon complement(4620670..4622358) /note = "predicted 
operon" /note = "ordered genes contained in the operon: IplA 
smp " 

Operon (1 958086.. 1 9598 19) /note = "predicted operon" 
/note = "ordered genes contained in the operon: argS 11 

Operon complement^ 99067.. 902957) /note = "documented 
artPIQMJ operon" 
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+ 



Operon complement(1431 108.. 143 1698) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
bl374 " 

Operon complement(3393963..3396015) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
cafAyhdE" 

Operon (47246..49631) /note - "predicted operon" /note = 
"ordered genes contained in the operon: yabF kefC " 

Operon (2852361. .2854439) /note = "predicted operon" 
/note = "ordered genes contained in the operon: fhlA " 

Operon (3208422..3212529) /note - "documented rpsU- 
dnaG-rpoD operon" 

Operon complement(2493070..2493312) /note - "predicted 
operon" /note = "ordered genes contained in the operon: 
b2377 " 

Operon complement(41 183 1..416176) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
sbcC sbcD " 

Operon (4105132..4106094) /note = "predicted operon" 
/note = "ordered genes contained in the operon: pflcA " 

Operon (2808791 ..2809321) /note = "predicted operon" 
/note = "ordered genes contained in the operon: emrR " 

Operon (2042885..2050036) /note = "predicted operon" 
/note = "ordered genes contained in the operon: bl978 " 

Operon (25 3 467.. 254202) /note = "predicted operon" /note 
= "ordered genes contained in the operon: b0235 prfH " 

Operon complement(4553059.;4553889) /note = "predicted 
operon" /note — "ordered genes contained in the operon: yjiC 



+ Operon (43394S9..4342368) /note = "documented melAB 
operon" 

Operon complement^ 10405.. 9 13043) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b0872 b0873 " 

Operon complement(930308..93 1273) /note = "predicted, 
operon" /note = "ordered genes contained in the operon: trxB 

Operon complement(25 1 3663 ..25 1 5969) /note = "predicted 
operon" /note — "ordered genes contained in the operon: 
yfeA" 

+ Operon (3866983..3868068) /note = "predicted operon" - 
/note = "ordered genes contained in the operon: yidS " 
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Operon complement(2447248..2453021) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b2332 b2333 b2334 b2335 b2336 b2337 b2338 " 

Operon complement(4577638..4580618) /note « 
"documented hsdMS operon" 

Operon complement(3 144871 ..3 145706) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b2999 b3000" 

complement(360473..365529) 

Operon complement(3261327..3264706) /note = 
"documented tdcABC operon" 

Operon (320832..323677) /note = "predicted operon" /note 
= "ordered genes contained in the operon: b0306 b0307 
b0308 " 

Operon (29901 16..2991492) /note - "predicted operon" 
/note = "ordered genes contained in the operon: b2852 " 

Operon complement(4067055..4067981) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yihR " 

Operon complement(1417789..1418265) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b!356 " 

Operon (547838. .550555) /note = "predicted operon" /note 
= "ordered genes contained in the operon: b0519 b0520 ybcF 

Operon (142 1806.. 1424004) /note = "predicted operon" 
/note = "ordered genes contained in the operon: trkG bl364 
b!365bl366 " 

Operon complement(2643033..2650307) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b2519b2520" 

Operon (179237.. 180754) /note = "predicted operon" /note 
= "ordered genes contained in the operon: dgt " 

Operon complement(602639..603886) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
ybdG" . 

Operon (432471 3.. 43 27 8 1 6) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yjdA yjcZ " 

Operon complement(2519613..2522898) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
xapR xapB xapA " 

Operon complement(2224529..2225290) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yohF" 
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Operon (3776681.-3778644) /note = "predicted operon" 
/note = "ordered genes contained in the operon: IctR IctD " 

Operon (1 848884.. 1 849900) /note = "documented ansA 
operon" 

Operon (3225442..3228880) /note = "predicted operon" 
/note = "ordered genes contained in the operon: ygjj ygjK " 

Operon complement(263 8706. .2640864) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
gcpE yfgA " 

Operon complement^ 1 328 87.J 1343 86) /note = "predicted 

operon" /note = "ordered genes contained in the operon: pitB 
ii 

Operon complement(1061773..1062998) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yccD cbpA " 

Operon (106557.. 107474) /note = "predicted operon" /note 
= "ordered genes contained in the operon: lpxC " 

Operon (339389..341731) /note = "predicted operon" /note 
= "ordered genes contained in the operon: b0323 b0324 " 

Operon (3535122. .3537344) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yhgF " 

Operon (3150251. .3151438) /note = "documented metC 
operon" 

Operon complement(3363337..3364353) /note = "predicted 
operon" /note = "ordered genes contained in tlie operon: 
yi52_l " 

Operon (4055987..4057762) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yihK " 
Operon (190..5020) /note = "documented thrLABC operon" 

Operon complement(858436..859251) /note =- "predicted 
operon" /note = "ordered genes contained in the operon: 
b0822 " 

Operon complement(1145234..1145857) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yceF " 

Operon (1312044.. 1312682) /note - "predicted operon" 
/note = "ordered genes contained in the operon: yciD ,T 
complement(360473 ..365529) 

Operon (539783..542257) /note - "predicted operon" /note = 
"ordered genes contained in the operon: b0513 b0514 " 
Operon complement(296605..301 797) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b0282 b0283 b0284 b0285 b0286 " 
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Operon complement(223 8648.. 223 9688) /note = 
"documented galS operon" 

Operon complement(4535227..4537078) /note = "predicted 
operon" /note = "ordered genes contained in the operon: yjhT 
yjhA " 

Operon complement(720279..723637) /note = "documented 
kdpDE operon" 

Operon complement^ 411555. .141 54 10) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
ydaC lar recT recE " 

Operon complement(30681 85. .307171 1) /note = "predicted 
operon" /note = "ordered genes contained in the operon: fba 
pgk epd " 

Operon complement(222745 8.. 222 8405) /note - "predicted 
operon" /note = "ordered genes contained in the operon: yohl 

II 

Operon complement(l 4471 00.. 1449373) /note - "predicted 
operon" /note - "ordered genes contained in the operon: 
tynA " 

Operon complement^ 171 520..3 175926) /note = "predicted 
operon" /note = "ordered genes contained in the operon: parE 
yqiA ice yqiB b3034 " 

Operon complement(39044 81 ..3909153) /note = 
"documented pstSCAB-phoU operon" 

Operon (2239L.27227) /note = "documented ileS-lspA- 
lytB operon" 

Operon (972760..980009) /note = "predicted operon" /note 
= "ordered genes contained in the operon: smtA mukF mukE 
mulcB " 

Operon complement(3043178..3043921) /note = "predicted 

operon" /note = "ordered genes contained in the operon: ygfF 
it 

Operon complement 14921 72.. 1493236) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
bl422 " 

Operon (4619338..4620669) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yjj J " 

Operon (5234.. 5530) /note = "predicted operon" /note = 
"ordered genes contained in the operon: b0005 " 

Operon complement(2409459..241 0632) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b2293 b2294" 
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Operon complement 1 0643.. 1 1356) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
bOOll" 

Operon (2 160898..2 163020) /note - "documented baeSR 
operon" 

Operon (4008666..4009565) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yigM " 

Operon complement(2699761.. 2702083) /note = 
"documented rnc-era-recO operon" 

Operon complement(550750..552323) /note = "documented 
purEK operon" 

Operon complement^ 877427.. 1877972) /note = "predicted 
operon" /note « "ordered genes contained in the operon: 
bl796bl797" 

Operon complement(4294798.. 4296945) /note = 
"documented fdhF operon" 

Operon complement(2223064..2223675) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yohC " 

Operon complement(3795866..3 805725) /note = 
"documented rfaQGPSBIJYZK operon" 

Operon complement(3396024..3398784) /note = 
"documented mreBCD operon" 

Operon (3776681. .3778644) /note = "predicted operon" /note 
= "ordered genes contained in the operon: IctR IctD " 
complement(3 60473 ..365 529) 

Operon complement(542485..545587) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b0515b0516b0517" 

Operon complement(3981965..3983620) /note = "predicted 
operon" /note = "ordered genes contained in the operon: aslA 

Operon complement 1274402.. 1276841) /note = 
"documented narXL operon" 

Operon (3234934..323S938) /note = "predicted operon" 
/note = "ordered genes contained in the operon: ygjR " 

Operon complement(747144..751401) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b0716b0717b0718" 

Operon (269466..270978) /note = "predicted operon" /note 
— "ordered genes contained in the operon: b0255 tra8_l " 
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Operon complement(3508698..3509763) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yhfYyhfZ" 

Operon complement(276980.. 279099) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yagC b0263 insBJ insA_2 " 

Operon complement 14971 5.. 1 52854) /note - "predicted 
operon" /note = "ordered genes contained in the operon: 
yadC yadK yadL yadM " 

+ Operon (632809..633969) /note = "predicted operon" /note 
= "ordered genes contained in the operon: b0600 " 

+ Operon (440773..442221) /note = "predicted operon" /note 
= "ordered genes contained in the operon: b0423 " 

+ Operon (319451. .320305) /note = "predicted operon" /note 
= "ordered genes contained in the operon: b0305 " 
Operon complement(3 868065. .3 869402) /note = "predicted 
operon" /note = "ordered genes contained in the operon: yidT 

+ Operon (1312044..13 12682) /note = "predicted operon" /note 
= "ordered genes contained in the operon: yciD " 

Operon complement(2034816..2036893) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b!968 b!969" 

+ Operon (703 87.. 7 1265) /note = "documented araC operon" 
+ Operon (770678..773404) /note = "documented cydAB 
operon" 

+ Operon (4569935..4571500) /note = "predicted operon" 
/note == "ordered genes contained in the operon: yjiT " 

+ Operon (4266993.. 4267706) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yjbP " 
Operon complement(3348330.. 3350660) /note = "predicted 
operon" /note == "ordered genes contained in the operon: arcB 



+ Operon (2923370..2924218) /note 
/note = "ordered genes contained in 

+ Operon (1 1 4367 1..1 144045) /note 
/note = "ordered genes contained in 

+ Operon (1475639.. 1480225) /note 
/note = "ordered genes contained in 
bl410bl411" 

+ Operon (3089126..3091 935) /note 
/note = "ordered genes contained in 
b2948 b2949 " 



= "predicted operon" 
the operon: b2794 " 

= "predicted operon" 
the operon: b!085 " 
== "predicted operon" 
the operon: M408 bl409 

= "predicted operon" 
the operon: yggj gshB 
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Iux-a.pk0020.g6 

lux-a.pk0025.fl 

Iux~a.pk031.a3 

Iux-a.pk035.c7 

Iux-a.pk039.h3 

Iux-a.pk045.f3 
Iux-a.pk050.h3 
Iux-a.pk055.h9 
Iux-a.pk063.d9 
Iux-a.pk070.d2 



Operon complement(3545619..3550106) /note == 
"documented malPQ operon" 

Operon complement 104 1253.. 1048555) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yccC b0982 b0983 b0984 b0985 b0986 " 

Operon complement(3 056686.. 3 057345) /note = "predicted 
operon" /note = "ordered genes contained in the operon: rpiA 

+ Operon (36981 92.. 3 699463) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yhj V " 
Operon complement^ 043 178..3043921) /note = "predicted' 

operon" /note = "ordered genes contained in the operon: ygfF 
ii 

Operon complement(2752029..2752785) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b2618b2619" 

- Operon (4202 10..421 583) /note = "predicted operon" /note = 
"ordered genes contained in the operon: b0402 " 

Operon compiement(5 87205.. 5 92401) /note = "documented 
nfrBA operon" 

+ Operon ( 1 892829. . 1 894772) /note = "predicted operon" 
/note = "ordered genes contained in the operon: pabB yeaB " 

+ Operon (340 523 8.. 3 4075 8 8) /note = "documented panF- 
prmA operon" 

Operon complement(3383856..3387041) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yhcP yhcQ b3242 " 

Operon complement^ 868278.. 2 870843) /note = "predicted 

operon" /note = "ordered genes contained in the operon: 

b2745 ygbB b2747 b2748 " 
+ Operon (14634 16.. 1465974) /note « "predicted operon" 

/note = "ordered genes contained in the operon: ydbA_l " 
+ Operon (4483786..4485968) /note = "predicted operon" 

/note = "ordered genes contained in the operon: yjgP yjgQ " 

+ Operon (784856..785908) /note = "documented aroG 
operon" 

+ Operon (7053 16..706980) /note = "predicted operon" /note 
= "ordered genes contained in the operon: glnS " 

Operon complement^ 593 97.. 862761) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b0823b0824" 
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Iux-a.pk076.f2 

Iux-a.pk082.d4 
Iux-a.pk005.b6 

Iux-a.pk0012.c3 

lux- 

a.pk0020.hll 
Iux-a.pk0025.f3 

lux-a.pk031.cl 1 

lux-a.pk035.gll 

Iux-a.pk039.h5 

lux-a.pk045.gl 

lux-a.pk051.bl 

lux-a.pk056.all 

lux-a.pk063.el 
Iux-a.pk070.e2 
Iux-a.pk076.f8 

Iux-a.pk082.d5 



+ Operon (1777641. .1779363) /note = "predicted operon" 

/note = "ordered genes contained in the operon: b!697 bl698 

rt 

+ Operon (4552145..455291 8) /note = "predicted operon" 
/note = "ordered genes contained in the operon: uxuR M 

+ Operon (2151891..2160901) /note » "predicted operon" 

/note = "ordered genes contained in the operon: b2074 b2075 
b2076 b2077" 

+ Operon (3478926..3482073) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yheS yheT 
yheU " 

Operon complement(2500010..2506262) /note = "predicted 
operon" /note — "ordered genes contained in the operon: 
b2383 b2384 b2385 b2386 b238? " 
+ Operon (2 194494. .220 193 1) /note = "predicted operon" 
/note = "ordered genes contained in the operon: molR molR 
molRyehl" 

Operon complement^ 146992..3 147486) /note = "predicted 
operon" /note = "ordered genes contained in the operon; 
b3002" 

+ Operon (3101031. .3102386) /note = "predicted operon" 

/note = "ordered genes contained in the operon: mutY b2962 
it 

Operon complement(2469097..2471266) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b2354 b2355 b2356 b2357 b2358 " 

Operon complement(4346893.. 4349240) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yjdGyjdH" 

Operon compiement(l 158585.. 1 160774) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
fhuE " 

+ Operon (571689..572956) /note = "predicted operon" /note 
= "ordered genes contained in the operon: b0547 b0548 
b0549 b0550 " 

Operon complement(l 181006..1 184817) /note = 
"documented potABCD operon" 
+ Operon (6241 08.. 62 8 5 20) /note = "documented entCEB A 
operon" 

Operon complement(3640010..3642812) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yhiQ prlC " 

Operon complement(2706774..2708032) /note = 
"documented rpoE-rseA operon" 
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Iux-a.pk005.el2 
Iux-a.pk0012.c7 

Iux-a.pk0020.h5 

Iux-a.pk0025.f5 
lux-a.pk03 l.c7 
Iux-a.pk035.h9 * 

lux-a.pk040.al 1 
Iux-a.pk045.g4 

lux-a.pk051.bll 

Iux-a.pk056.b3 

Iux-a.pk063.e8 
Iux-a.pk070.e4 
lux-a.pk077.al 1 
Iux-a.pk082.e8 

Iux-a.pk005.f6 

Iux-a.pk0012.e3 
lux- 

a.pk0021.al0 



+ 



Operon (4435285..44357S5) /note = "predicted operon" 
/note = "ordered genes contained in the operon: b4215 " 

Operon complement(34723 15..3474089) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yheL b3344 yheN b3346 " 

Operon complement(l 841 855.. 1844984) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
bl762topB " 

Operon (164730.. 167264) /note = "predicted operon" /note 
= "ordered genes contained in the operon: mrcB " 

Operon complement(303 1 677. .3 034226) /note = 
"documented prfB-lysS operon" 

Operon compiement(l 863750.. 1 864496) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
bl782 " 

Operon (1710793. .1712295) /note = "predicted operon" 
/note = "ordered genes contained in the operon: bl634 " 

Operon complement(2833196..2835448) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
hypF " 

Operon complement(3112567..3117128)/note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b2973 b2974" 

Operon complement(2948657.. 2956906) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
recD recB ptr" 

Operon (1807404.. 1808072) /note = "predicted operon" 
/note = "ordered genes contained in the operon: bl727 " 

Operon (2589267..2590754) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yffB dapE " 

Operon complement(608682..61 1717) /note = "documented 
fepA-entD operon" 

Operon ( 4028751. .4032033) /note - "predicted operon" 
/note = "ordered genes contained in the operon: pepQ yigZ 
trkH" 

Operon (3602024..3602879) /note = "predicted operon" 

/note = "ordered genes contained in the operon: yhhF b3466 
it 

Operon (3540803. .3541681) /note = "predicted operon" /note 
= "ordered genes contained in the operon: yhgA " 

Operon (3285731. .3290073) /note = "predicted operon" . 
/note = "ordered genes contained in the operon: yral yraJ 
yraK" 
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lux- 

a.pk0025.gll 

lux-a.pk031.dl 

Iux-a.pk036.a5 

Iux-a.pk040.b5 

Iux-a.pk046.a5 

lux-a.pk051.cll 

Iux-a.pk056.b4 

Iux-a.pk063.g7 

lux-a.pk070.fll 

Iux-a.pk077.a3 

Iux-a.pk082.g4 

Iux-a.pk005.gl0 

lux- 

a.pk0012.fll 
Iux-a.pk0021.b6 

Iux-a.pk0025.g5 

Iux-a.pk031.e7 

Iux-a.pk036.b6 

Iux-a.pk040.b7 



Operon complement(4159749..4I60849) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
trmA " 

Operon 'complement 13 96798.. 13 975 50) /note = 
"documented fhr operon" 

Operon complement(4586446..4588864) /note = "predicted 
operon" /note = "ordered genes contained in the operon: yjiX 
yjiY" 

Operon complement(3913 181 ..3920080) /note = 
"documented atpIBEFHAGDC operon" 

Operon (1846861.. 1848717) /note = "predicted operon" 
/note = "ordered genes contained in the operon: sppA " 

Operon complement 1 607253 .. 1 608704) /note = 
"documented uxaB operon" 

Operon complement(l 588878.. 1590466) /note = 
"documented hipBA operon" 

Operon (1741481..1742854) /note = "predicted operon" 
/note = "ordered genes contained in the operon: ydhE " 

Operon (3834580..3835764) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yicK " 

Operon complement(3130469..3131972) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b2984b2985" 

Operon (1 08279.. 1 10984) /note = "predicted operon" /note 
= "ordered genes contained in the operon: secA " 

Operon (1486256.. 1487695) /note - "predicted operon" 
/note = "ordered genes contained in the operon: aldA " 

Operon (45807..47138) /note = "predicted operon" /note = 
"ordered genes contained in the operon: yaaU " 

Operon complement(3427403.. 3429566) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yrdB aroE yrdC yrdD " 

Operon (738224..740148) /note = "predicted operon" /note 
= "ordered genes contained in the operon: ybgA phrB" 
Operon (27293.. 28207) /note = "predicted operon" /note = 
"ordered genes contained in the operon: yaaF " 

Operon complement(4428899.. 4429561) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b4209 " 

Operon complement(l 582231 ..1584510) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
bl501" 
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Iux-a.pk046.b5 
Iux-a.pk051.dl0 

Iux-a.pk056.c3 

Iux-a.pk064.cl0 

Iux-a.pk070.g5 
Iux-a.pk077.a9 

Iux-a.pk082.h2 
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Iux-a.pk0021.b8 
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Iux-a.pk040.c9 

lux-a.pk046.cl 

Iux-a.pk051.d5 
Iux-a.pk056.d6 

Iux-a.pk064.c7 



+ 



+ 



Operon complement(3044188..3048687) /note = 
"documented gcvTHP operon" 

Operon complement^ 15 69 69.. 4 158303) /note = "predicted 
operon" /note - "ordered genes contained in the operon:* 
udhA " 

Operon complement^ 3 2 72 5..3 3 3 657) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b0316" 

Operon complement(2945779..2947122) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b2817 " 

Operon (4328080..4329582) /note - "documented proP 
operon" 

Operon complement(229967..230881) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yafC " 

Operon (1753722.. 1755 134) /note = "predicted operon" 
/note = "ordered genes contained in the operon: pykF " 

Operon (40496 19..4050998) /note = "predicted operon" 
/note = "ordered genes contained in the operon: hemN " 

Operon (1218824..1220344) /note = "predicted operon" 
/note « "ordered genes contained in the operon: bl 169 " 

Operon (3 928943. .3930502) /note = "predicted operon" 
/note = "ordered genes contained in the operon: kup " 

Operon (1785469.. 1786302) /note = "predicted operon" 
/note = "ordered genes contained in the operon: ydiA " 

Operon (533140..535710) /note = "predicted operon" /note 
= "ordered genes contained in the operon: gel gip " 

Operon (93 8651. .93 9943) /note = "predicted operon" /note 
= "ordered genes contained in the operon: serS " 

Operon complement^ 8 3903..586 131) /note = "documented 
envY-ompT operon" 

Operon (535810..53831 1) /note = "predicted operon" /note 
= "ordered genes contained in the operon: b0509 b0510 
b0511" 

Operon (638168..640541) /note = "documented ahpCF 
operon" 

Operon (1360767.. 1364839) /note = "predicted operon" 
/note = "ordered genes contained in the operon: aldH ordL 
goaG " 

Operon complement(l 85301 5.. 1859356) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
bl771 M772 b!773 bi774 b!775 b!776 " 
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Operon complement(408332..409243) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yaiD " 

Operon compIement(75 1452.. 75201 8) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
ybgD " 

Operon complement^ 05 827.. 5 063 06) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b0481 " 

complement^ 60473 ..3 65529) 

+ Operon (819107.. 81981 1) /note = "predicted operon" /note 
= "ordered genes contained in the operon: b0786 " 

Operon compiement(l 244902.. 1 246599) /note = "predicted 
operon" /note = "ordered genes contained in the operon: treA 

Operon complement(764376..765098) /note = "predicted 

operon" /note = "ordered genes contained in the operon: farR 
tt 

+ Operon (2261883. .2263064) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yeiO " 

+ r Operon (41 56069..41 56986) /note - "documented oxyR 
operon" 

Operon complement(267 1 836..26773 87) /note = "predicted 
operon" /note — "ordered genes contained in the operon: 
b2544 b2545 b2546 b2547 b2548 " 

+ Operon (2282 1 49..2284 1 56) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yejL yejM " 
Operon complement 1 722760. . 1 724082) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
bl647b!648" 

+ Operon (1 63 1 646.. 1632236) /note = "predicted operon" 
/note = "ordered genes contained in the operon: bl545 " 

+ Operon (3274643.3275442) /note « "predicted operon" 

/note = "ordered genes contained in the operon: sohA yhaV " 
Operon complement(906075..910272) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b0869 b0870 poxB " 

+ Operon (3884457..3885821) /note = "predicted operon" 
/note = "ordered genes contained in the operon: thdF " 

Operon complement(2436962..2438140) /note = "predicted 
operon" /note = "ordered genes contained in the operon: . 
b2322 " 
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Operon complement(841019..841423) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b0806 " 

Operon complement(4333272,.4334609) /note = "predicted 
operon" /note = "ordered genes contained in the operon; 

yjdD" 

Operon (4133655..4134593) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yijE " 

Operon (4448633..4452152) /note = "predicted operon" 
/note = "ordered genes contained in the operon: ytfR ytfS 
ytfT yjfF " 

Operon (3735 126.. 373 7 156) /note = "documented malS 
operon" 

Operon complement(837753.. 840754) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b0804 b0805 " 

Operon (3791614..3795834) /note = "documented rfaDFCL 
operon" 

Operon (2765725..2766594) /note = "predicted operon" 
/note - "ordered genes contained in the operon: b2632 " 

Operon complement(2943058..2943864) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b2812 " 

Operon complement (7 1748 5.. 71 9683) /note = "documented 
speF operon" 

Operon (3058870..3064300) /note == "predicted operon" 
/note = "ordered genes contained in the operon: sbm ygfD 
b2919b2920" 

Operon complement(405 1449..4055614) /note = 
"documented glnALG operon" 

Operon (34 16027..34 16248) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yhdV " 

Operon complement^ 120784..1 121830) /note = 
"documented pyrC operon" 

complement(360473..365529) 

Operon (2848670..2852287) /note = "documented 
hypABCDE operon" 

Operon (331595..332683) /note = "predicted operon" /note 
= "ordered genes contained in the operon: yahA " 

Operon (3229306..323 1 324) /note = "predicted operon" 
/note = "ordered genes contained in the operon: ygjL " 

Operon (1097070.. 1098047) /note = "predicted operon" 
/note = "ordered genes contained in the operon: b!033 " . 
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Operon complement(1665368.. 1666588) /note = "predicted 

operon" /note = "ordered genes contained in the operon: mlc 
ii 

Operon (4422696..4424135) /note = "documented rpsF- 
priB-rpsR-rplI operon" 

Operon complement(4085688..4090404) /note « "predicted 
operon" /note = "ordered genes. contained in the operon: frvR 
frvX frvB firvA " 

Operon complement(476291.. 477847) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b0457 " 

Operon (1725861..1726268) /note = "predicted operon" 
/note = "ordered genes contained in the operon: b!651 " 

Operon complement(2225343.. 2226859) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yohG yohH " 

Operon (3244277..3245068) /note = "predicted operon" 
/note = "ordered genes contained in the operon: exuR " 

Operon (3291 041. .3294625) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yraM yraN 
yraO yraP " 

Operon (1532048.. 1532893) /note = "predicted operon" 
/note = "ordered genes contained in the operon: bl463 " 

Operon complement(24 17861 ..2418505) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b2301 " 

Operon (8238..9191) /note = "predicted operon" /note = 
"ordered genes contained in the operon: talB " 

Operon (1944 176.. 1944877) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yebB " 

Operon complement(3875333..3877747) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
gyrB - 

Operon complement(1020953..1023571) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b0960yccF" . 

Operon complement(3250958.325 1854) /note = "predicted 

operon" /note = "ordered genes contained in the operon: yhaJ 
ii 

Operon (1321244.. 1324665) /note = "predicted operon" 
/note = "ordered genes contained in the operon: bl266 yciO 
yciQ " 
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Operon complement^ 84257.. 1 88650) /note = "predicted 
operon" /note = "ordered genes contained in the operon: yael 
b0165dapDglnD" 

Operon complement(4280832..4282792) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yjcGyjcH" 

Operon complement(445 8096.. 4460234) /note = "predicted 
operon" /note « "ordered genes contained in the operon: 
nrdD M 

Operon complement(2748136..2748729) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
grpE" 

Operon complement(2547666..2548592) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b2431 " 

Operon (4628275..4630239) /note = "predicted operon" 
/note = "ordered genes contained in the operon: sit " 

Operon complement 1 1 6974 1 1 1 73 1 87) /note = "predicted 
operon" /note = "ordered genes contained in the operon: mfd 

complement^ 60473.. 365529) 

Operon (142779.. 144472) /note = "predicted operon" /note 
= "ordered genes contained in the operon: yadG yadH " 

Operon complement(4612249..46 14634) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yjjWyjjl" 

Operon (23 18063.. 232 1271) /note = "predicted operon" 
/note = "ordered genes contained in the operon: atoS atoC " 

Operon complement^ 1 293 5 6..3 130333) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b2983 " 

Operon complement(3836802..3 837620) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
nlpA " 

Operon complement^ 927224.. 3 928744) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yieN" 

Operon (401 4920.. 40 16347) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yigN " 

Operon (3995596.3997758) /note - "documented uvrD 
operon" 

Operon complement(947883..948791) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b0900" 
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Iux-a.pk064.h8 

Iux-a.pk071.g2 

Iux-a.pk077.e4 

lux-a.pk085.al 1 
Iux-a.pk006.b4 

lux- 

a.pk0017.cl0 

Iux-a.pk0021.f6 

Iux-a.pk0026.c8 

Iux-a.pk031.g9 
Iux-a.pk036.e8 

Iux-a.pk040.g4 
Iux-a.pk046.g7 

Iux-a.pk052.b7 

Iux-a.pk058.a9 
Iux-a.pk065.d9 

Iux-a.pk071.g4 



Operon complement(1550852..1551892) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
bl478 " 

Operon complement(4585479. .4586333) /note = "predicted 

operon" /note = "ordered genes contained in the operon: yjiA 
»t 

Operon complement 1 59664 L. 1 598233) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
bl511 " 

+ Operon (2435970..2436965) /note = "predicted operon" 
/note = "ordered genes contained in the operon: div " 

Operon complement(1313880.. 13141 16) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yciG " 

Operon complement(441 3595. .4413897) /note = "predicted 

operon" /note = "ordered genes contained in the operon: yjfN 
»t 

Operon complement(3215197..3216717) /note = "predicted 
operon" /note = "ordered genes contained in the operon: air " 
+ Operon (249669 1 . .2500007) /note - "predicted operon" 

/note = "ordered genes contained in the operon: b2380 b2381 
b2382 " 

+ Operon (358023..360370) /note = "documented cynTSX 
operon" 

Operon complement (245 5 03 5.. 245 848 9) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b2341 b2342" 

Operon complement(3137731..3143 155) /note = 
"documented hybGFEDCBA operon" 

Operon complement(1355826..1357265) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
bl296" 

Operon complement(l 884888.. 1 88601 5) /note = "predicted 

operon" /note = "ordered genes contained in the operon: rnd 
ii 

+ Operon (3963322..3963705) /note = "predicted operon" 
/note = "ordered genes contained in the operon: trxA " 

Operon complement^ 878910.. 1879854) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
bl799 " 

Operon complement^ 160760..3 161497) /note = "predicted . 
operon" /note = "ordered genes contained in the operon: plsC 
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Iux-a.pk077.fl0 

Iux-a.pk085.a5 

Iux-a.pk006.c9 

Iux-a.pk0017.c7 

Iux- 

a.pk0021.gll 
lux-a.pk0026.dl 

Iux-a.pk032.bl0 

Iux-a.pk036.h7 
Iux-a.pk040.g6 

Iux-a.pk047.al0 
lux-a.pk052.cll 
lux-a.pk058.cl 

Iux-a.pk065.e4 
Iux-a.pk071.g6 

Iux-a.pk077.fl2 
lux-a.pk085.cl 

Iux-a.pk006.el0 
Iux-a.pk0017.c9 
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Operon (2991 660..2991 878) /note = "predicted operon" 
/note = "ordered genes contained in the operon: b2853 " 

Operon (4254694..4256700) /note = "documented lexA- 
dinF operon" 

Operon (4381375..4382919) /note = "predicted operon" 
/note — "ordered genes contained in the operon: yjeM " 

Operon (504138..505790) /note = "predicted operon" /note 
= "ordered genes contained in the operon: ushA " 

Operon (4581 12..460466) /note = "predicted operon" /note 
= "ordered genes contained in the operon: Ion " 

Operon (1 1 3444.. 1 1 4487) /note = "predicted operon" /note 
= "ordered genes contained in the operon: guaC " 

Operon (1262937.. 1268242) /note = "predicted operon" 
/note = "ordered genes contained in the operon: hemA prfA 
hemKbl213ychAlcdsA" 

Operon (182445.. 183620) /note = "predicted operon" /note 
= "ordered genes contained in the operon: yaeG " 

Operon complement(2434735.. 2435871) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
pdxB " 

Operon (201 1251. .2017535) /note = "documented 
fliFGHIJK operon" 

Operon (1 124785..1 126952) /note = "predicted operon" 
/note = "ordered genes contained in the operon: rimJ yceH " 

Operon complement(325 3729.325 5 597) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yhaN yhaO " 

Operon (3272923.3274494) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yhaG " 

Operon complement(4454888..4455439) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 

yjgA " 

Operon complement(2841059..2848458) /note = 
"documented hycABCDEFGH operon" 

Operon complement(944 154.. 944780) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
ycaC " 

Operon complement(2122545..2131419) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yefD yefC yefB yefA b2054 b2055 b2056 b2057 wcaB 
b2059 " 

Operon (3958292.3960313) /note = "predicted operon" 
/note = "ordered genes contained in the operon: rep 11 
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Iux-a.pk0022.a2 
lux- 

a.pk0026.el0 
Iux-a.pk032.b2 



Iux-a.pk036.h9 
Iux-a.pk041.al0 

lux-a.pk047.al 1 

Iux-a.pk052.c5 

lux-a.pk058.cl 1 
lux-a.pk065.fl 1 

Iux-a.pk072.al0 

Iux-a.pk077.f3 

lux-a.pk085.fl 

Iux-a.pk006.f7 

Iux-a.pk0017.d3 
lux-a.pk0022.bl 
lux- 

a.pk0026.fll 
Iux-a.pk032.c3 



Operon compIe'ment(2536692..2541 548) /note = 
"documented cysPUWAM operon" 

Operon (3610600.361 1 1 87) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yhhU " 

Operon (328181 1.. 3284666) /note = "predicted operon" 
/note = "ordered genes contained in the operon: agaB agaC 
agaD agal " 

Operon (83 0095.. 83 1459) /note = "predicted operon" /note 
= "ordered genes contained in the operon: rhlE " 

Operon complement(3780269..3781755) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
gpsA secB " 

Operon (4016442..4021926) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yigO yigP 
yigQ yigTa yigTb yigTc yigU yigW yigW " 

Operon complement(25741 1 8..2S76397) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b2463 " 

Operon complement 1980578.. 19841 51) /note = 
"documented araFG-M899-araH operon" 

Operon complement(3376505.. 3 377632) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yhcM " 

Operon complement 14971 5.. 152854) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yadC yadK yadL yadM " 

Operon (396553L.3972101) /note = "predicted operon" 
/note = "ordered genes contained in the operon: rfe b3785 
rffE ifiD rffG rffH " 

Operon complement(54755..571 09) /note = "predicted 

operon" /note = "ordered genes contained in the operon: imp 
ii 

i 

Operon complement^ 865689.. 3 866939) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yidR " 

Operon (4476036..4476452) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yjgD " 

Operon complement(4025199..4028561) /note = 
"documented fadBA operon" 

Operon complement(78848..83708) /note = "documented 
leuLABCD operon" 

Operon complement(2744454..27458 1 5) /note = "predicted 
operon" /note = "ordered genes contained in the operon: ffh " 
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Iux-a.pk037.a3 
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Operon (3857880..3858803) /note = "predicted operon" 
/note = "ordered genes contained in the operon: b3680 M 

Operon complement(640662..641090) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b0607 " 

Operon complement(661 975..663 186) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
dacA " 

Operon complement^ 892901.. 3 8 9423 8) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yieG" 

Operon complement(l 1 87539.. 1 1 89670) /note = 
"documented phoQP operon" 

Operon (2135858..2137507) /note = "predicted operon" 
/note = "ordered genes contained in the operon: b2063 " 

Operon complement(l 8461 49.. 1846700) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
ydjA " 

Operon complement(4534182..4535162) /note = "predicted 
operon" /note = "ordered genes contained in the operon: yjhS 

Operon complement(101 5762.. 1017522) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b0955 " 

Operon (989845.. 992457) /note = "predicted operon" /note 
= "ordered genes contained in the operon: pepN " 

Operon complement(2334813..2337440) /note = 
"documented gyrA operon" 

Operon complement(4188313.. 4193677) /note = 
"documented thiCEFGH operon" 

Operon (4276058..4277407) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yjcD " 

Operon (246 1032.. 2462090) /note - "predicted operon" 
/note = "ordered genes contained in the operon: b2345 " 

Operon complement(824225.. 829878) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b0792 b0793 b0794 b0795 ybiH " 

Operon complement(3 334604.. 3 337706) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yrbB yrbC b3193 yrbE b3195 " 

Operon (2735619..2736925) /note = "documented pheLA 
operon" 
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Operon (1732459.. 1733274) /note - "predicted . 
/note = "ordered genes contained in the operon: b 



x y a i ^ ..7 , . .i / _/ : .j / -t j i n kj iw — p.*. ^ua\-L^d operon" 
/note = "ordered genes contained in the operon: I 
Operon (1928905. .1930083) /note = "predicted 
/note = "ordered genes contained in the operon: purT " 

Operon compiement(2246757..2247638) /note = "predicted 

operon" /note = "ordered genes contained in the operon: yeiE 
it 

+ Operon (538371 ..539732) /note = "predicted operon" /note 
= "ordered genes contained in the operon: b0512 " 

Operon complement(383283..383693) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b0364" 

+ Operon (532235.. 533050) /note = "predicted operon" /note 
= "ordered genes contained in the operon: b0506 " 

Operon complement(259585 1 ..2597780) /note = 
"documented dapA-nlpB operon" 

+ Operon (3 656862. .366 1 1 57) /note = "predicted operon" 
/note = "ordered genes contained in the operon: yhiU yhiV " 

Operon complement(4148026..4150677) /note - "predicted 
operon" /note = "ordered genes contained in the operon: ppc 

+ Operon (940269..9441 1 9) /note = "documented dmsABC 
operon" 

+ Operon (432226..434780) /note == "predicted operon" /note 
= "ordered genes contained in the operon: ybaD ribD ribH 
nusB " 

Operon complement(658474. .661435) /note = "predicted 
operon" /note = "ordered genes contained in the operon: lip A 
ybeFlipB" 

+ Operon (1737935. .1739146) /note = "predicted operon" 
/note = "ordered genes contained in the operon: ydhC " 
Operon complement(3322642. .3325305) /note = 
"documented ftsJ-hflB operon" 

Operon complement(2712459..271 3385) /note = "predicted 
operon" /note = "ordered genes contained in the operon: yfiE 

Operon complement^ 942370.. 1944000) /note = 
"documented ruvBA operon" 

+ Operon (3 155664..3 156593) /note = "predicted operon" . 
/note = "ordered genes contained in the operon: + " ' 

+ Operon (301 8561 ..3022207) /note = "predicted operon" • 
/note = "ordered genes contained in the operon: b2880 b2881 
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Iux-a.pk078.c2 
Iux-a.pk086.a5 

Iux-a.pk007.al0 

lux- 

a.pk0017.fll 
lux-a.pk0022.cl 

Iux-a.pk0026.g2 

Iux-a.pk032.e4 
Iux-a.pk037.c8 

Iux-a.pk041.d8 
Iux-a.pk047.c6 
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Iux-a.pk058.f3 

Iux-a.pk065.h2 

lux-a.pk072.dl 

Iux-a.pk078.c2 
Iux-a.pk086.b7 



Operon complement(288525.. 289529) /note » "documented 
argF operon" 

Operon complement(4302191..4304188) /note = "predicted 
operon" /note = "ordered genes contained in the operon: yjcS 

Operon complement^ 3 3 843 7.. 23 421 89) /note = "predicted 

operon" /note = "ordered genes contained in the operon: yfaL 
ii 

Operon complement^ 633 61 9..263541 5) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b2510b2511" 

+ Operon (334504..339313) /note = "predicted operon" /note 
— "ordered genes contained in the operon: b0318 b0319 
b0320 b0321 b0322 " 

Operon complement(786066.. 78681 8) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
gpmA" 

+ Operon (2141288..2144605) /note = "predicted operon" 
/note — "ordered genes contained in the operon: yegE " 
Operon complement (262 5 5 2.. 2 63 23 1) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
b0245 b0246 " 

Operon complement(622777..623733) /note = "documented 
fepB operon" 

+ Operon (3154754.3155464) /note = "predicted operon" 
/note = "ordered genes contained in the operon: b3012 " 

Operon complement^ 665421 ..366681 8) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
yhjA » 

Operon complement(3371333..3372124) /note = "predicted 
operon" /note — "ordered genes contained in the operon: 
b3226 " 

Operon complement(2009370..2009891) /note = "predicted 
operon" /note = "ordered genes contained in the operon: 
M933bl934" 

Operon complement(4300657..4301688) /note = "predicted 
operon" /note = "ordered genes contained in the operon: yjcR 

Operon complement(4474870..4475874) /note = 
"documented argl operon" 

+ Operon (1481 142..1484987) /note = "predicted operon" . 
/note = "ordered genes contained in the operon: hrpA " 



107 



WO 01/79419 



PCTAJS01/11855 



The numbers in parenthesis represent each end of the nucleotide sequence designation in the E. coli 
genomic sequence. The sequence designation is based on Blattner et al. ((1997) Science 277:1453- 
1462) 

Table 19 

lux-z.pk013.al 5 - Operon complement(3366649..3370239) note = "predicted 

operon" note ="ordered genes contained in the operon: b3221 
yhcl b3223 nanT" 

Iux-z.pk013.gl7 - Operon complement(4488774..4490093) note = "predicted 

operon" note — 'ordered genes contained in the operon: yjgT" 

Iux-z.pk013.i9 + Operon 3 128 193.. 3 129209 note = "predicted operon" note 

="ordered genes contained in the operon: yi52_9" 

lux-z.pk013.kl - Operon complement(2978786..2980204) note = "documented 

araE operon" 

lux-z.pk013.kll + Operon 2027561. .2028481 note == "predicted operon" note 

="ordered genes contained in the operon: yedA" 

Iux-z.pk013.k5 + Operon 2418641 ..2419288 note = "predicted operon" note 

— 'ordered genes contained in the operon: b2302" 

Iux-z.pk013.o5 - Operon complement^ 347004.. 1348209) note = "predicted 

operon" note =" ordered genes contained in the operon: 
bl287" 

Iux-z.pk014.al4 + Operon 1063259.. 10645 15 note = "predicted operon" note 

="ordered genes contained in the operon: yccE" 

Iux-z.pk014.a22 + Operon 557402..557977 note = "predicted operon" note 

-'ordered genes contained in the operon: b0530" 

Iux-z.pk014.cl6 + Operon 607288..608400 note = "predicted operon" note 

— ' ordered genes contained in the operon: yi812" 

Iux-z.pk014.c5 - Operon complement^ 561 358.. 1565 164) note = "predicted 

operon" note ="ordered genes contained in the operon: bl489 
bl490" 

lux-z.pk014.gl 1 - Operon complement(3055 1 98..3056430) note = "documented 

serA operon" 

Iux-z.pk014.h9 + Operon 948891. .949481 note = "predicted operon" note 

="ordered genes contained in the operon: ycaK" 

Iux-z.pk014.kl3 + Operon 3769908..3773786 note = "documented mtlADR 

operon" 

lux-z.pk014.11 + Operon 3 132146..3 132838 note = "predicted operon" note 

—"ordered genes contained in the operon: b2986" 

lux-z.pk014.ml + Operon 275291 7..2753399 note = "predicted operon" note 

^"ordered genes contained in the operon: smpB" 

lux-z.pk014.nl 9 - Operon complement(44391 15. .4439753) note = "predicted 

operon" note ^"ordered genes contained in the operon: msrA" 
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ux-z.pk014.o9 

ux-z.pk014.p3 
ux-z.pk014.p4 

ux-z.pk014.p8 

ux-z.pk015.a7 
ux-z.pk015.b23 

ux-z.pk015.c7 

ux-z.pk015.dl0 

ux-z.pk015.el7 

ux-z.pk015.fl 
ux-z.pk015.gl 

ux-z.pk015.gl3 
ux-z.pk015.g23 
ux-z.pk015.i5 

ux-z.pk015.nl 
ux-z.pk015.n24 
iux-z.pk015.n4 
ux-z.pk015.p2 

ux-z.pk015.p6 



Operon complernent(248 6043.. 2488206) note = "predicted 
operon" note - 'ordered genes contained in the operon: b2371 
b2372" 

Operon 4077307..4077549 note = "predicted operon" note 
="ordered genes contained in the operon: yiiF" 

Operon complement( 1930 139.. 1932628) note = "predicted 
operon" note -'ordered genes contained in the operon: eda 
edd" 

Operon complement(l 122630.. 1 123277) note = "predicted 
operon" note -'ordered genes contained in the operon: grxB" 
Operon 3408908. .3409204 note - "documented fis operon" 
Operon 4059826..4061091 note = "predicted operon" note 
=" ordered genes contained in the operon: yihN" 

Operon 2758568..2759194 note = "predicted operon" note 
-'ordered genes contained in the operon: b2626" 
Operon 2745907..2748081 note = "predicted operon" note 
-'ordered genes contained in the operon: b261 1 b2612 yfjD" 
Operon complement(2794358..2794807) note = "predicted 
operon" note -'ordered genes contained in the operon: 
b2665" 

Operon 2321467..2325313 note = "documented atoDAB 
operon" 

Operon 1509678.. 15 15026 note = "predicted operon" note 
-"ordered genes contained in the operon: bl440 bl441 bl442 
bl443 bl444" 

Operon 32 14420..32 15043 note = "predicted operon" note 
="ordered genes contained in the operon: b3071" 

Operon complement(2828798..2830387) note = "predicted 
operon" note - 'ordered genes contained in the operon: ygaA" 
Operon complement^ 7675 9.. 3 775 92) note = "predicted 
operon" note =" ordered genes contained in the operon: 
b0355" 

Operon complement(5863 14.. 5 87204) note = "predicted 
operon" note ="ordered genes contained in the operon: ybcH" 
Operon 25766S6..2579659 note = "predicted oppron" note 
-'ordered genes contained in the operon: b2464 tktB" 
Operon complement(4145045..4 145896) note = "predicted 
operon" note ="ordered genes contained in the operon: yijO" 
Operon complement(311 1560..31 12492) note = "predicted 
operon" note - 'ordered genes contained in the operon: 
b2972" 

Operon 1768612..1768995 note = "predicted operon" note 
="ordered genes contained in the operon: b 1 689" 
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ux-z.pk016.a4 - Operon complement(45808 19.. 4584385) note = "predicted 

operon" note ="6rdered genes contained in the operon: hsdR" 
ux-z.pk016.c22 + Operon 42T0813..421 1196 note = "predicted operon" note 

^"ordered genes contained in the operon: yjaA" 
ux-z.pk016.c9 + Operon 1073465. .1074103 note = "predicted operon" note 

="ordered genes contained in the operon: bl013" 
ux-z.pk016.el 1 - Operon complement(2812905..2814461) note = "predicted 

operon" note — 1 ordered genes contained in the operon: gshA" 

:ux-z.pk016.fl6 + Operon 2405581. .2406798 note = "predicted operon" note 

="ordered genes contained in the operon: b2290" 
ux-z.pk016.f6 + Operon 1234161. .1234880 note = "predicted operon" note 

="ordered genes contained in the operon: fadR" 
ux-z.pk016.h 16 + Operon 3054261. .3054809 note = "predicted operon" note 

— 'ordered genes contained in the operon: ygfA" 
ux-z.pk016.il7 - Operon complement(28 17403.. 2820033) note = "predicted 

operon" note ="ordered genes contained in the operon: alaS" 
ux-z.pk0 16.12 + Operon 4041737..4043209 note = "predicted operon" note 

-'ordered genes contained in the operon: yihF" 
ux-z.pk0 16.124 + Operon 1055484..1056512 note = "predicted operon" note 

="ordered genes contained in the operon: torT" 
ux-z.pk016.nl4 - Operon complement 122 8 706.. 1229623) note = "predicted 

operon" note -'ordered genes contained in the operon: 
bll82" 

ux-z.pk0 1 6.o 11 - Operon complement^ 1 08320..4 1 09087) note = "predicted 

operon" note ="ordered genes contained in the operon: tpiA" 

ux-z.pk016.ol7 + Operon 2475867..2478550 note = "documented dsdXA 

operon" 

ux~z.pk017.c2 + Operon 3550718..3553423 note = "documented malT operon" 
ux-z.pk017.gl9 + Operon 420210..421 583 note = "predicted operon" note 

- 'ordered genes contained in the operon: b0402" 

ux-z.pk017.g23 - Operon complement(4468560..4469969) note = "documented 

pyrBI operon" 

ux-z.pk017.i21 - Operon complement(3633838..3635040) note = "predicted 

operon" note =" ordered genes contained in the operon: yhiN" 

ux-z.pk017.i4 + Operon 30043 56..3005447 note = "predicted operon" note 

="ordered genes contained in the operon: b2870" 
iux-z.pk017.kl5 - Operon complement(2657583.. 2659575) note = "predicted 

operon" note ="ordered genes contained in the operon: yfhF 
b2529b2530" 

ux-z.pk017.ml8 + Operon 2270378..2275909 note = "predicted operon" note 

- 'ordered genes contained in the operon: yej A yejB yejE 
yejF" 



110 



• 



WO 01/79419 

Iux-z.pk018.al6 

Iux-z.pk018.e4 
Iux-z.pk018.gl8 

Iux-z.pk018.i2 

Iux-z.pk018.ol0 

Iux-z.pk019.al8 

hix-z.pk019.a24 

Iux-z.pk019.bl5 

Iux-z.pk019.b21 

Iux-z.pk019.c6 
Iux-z.pk019.el2 
Iux-z.pk019.e2 
lux-z.pk019.gl 

Iux-z.pk019.h5 

lux-z.pk019.jll 
Iux-z.pk019.jl5 
Iux-z.pk019.o21 
Iux-z.pk020.a23 
Iuxrz.pk020.cl6 
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- Operon complement(3738738..37392 1 1) note = "predicted 
operon" note -'ordered genes contained in the operon: not- 
yiaT 

+ Operon 42568 16..4257025 note = "predicted operon" note 
=" ordered genes contained in the operon: b4045" 
Operon complement(2488276..2489970) note = "predicted 
operon" note - 1 ordered genes contained in the operon: 
b2373" 

+ Operon 2037500..2039140 note = "predicted operon" note 
-'ordered genes contained in the operon: bl971 bl972" 

+ Operon 959463. .960251 note = "predicted operon" note 
- 'ordered genes contained in the operon: b0909" 

Operon complement(2242798.. 2244789) note = "documented 
cirA operon" 

Operon complement(4559066..4560244) note = "predicted 
operon" note - 'ordered genes contained in the operon: yjiJ" 
+ Operon 461 1 1 94..461 1 829 note = "predicted operon" note 
^"ordered genes contained in the operon: yjjV" 

Operon complement(3309474..3310819) note = "predicted 
operon" note ="ordered genes contained in the operon: truB 
rbfA" 

Operon complement(43 88035. .43 89048) note = "predicted 
operon" note ^"ordered genes contained in the operon: yjeQ" 
+ Operon 4263361. .4264440 note = "predicted operon" note 
-"ordered genes contained in the operon: air" 

+ Operon 1225823. .1226191 note = "predicted operon" note 
- 'ordered genes contained in the operon: bl 177" 

Operon complement^ 633 2 5.. 6681 51) note = "predicted 
operon" note - 'ordered genes contained in the operon: rlpA 
mrdB mrdA ybeA ybeB" 

Operon complement( 1555136. .1561100) note = "predicted 
operon" note ="ordered genes contained in the operon: bl483 
bl484 bl485 bl486 bl487 bl488" 

+ Operon 26623 83.. 2663264 note = "predicted operon" note 
-'ordered genes contained in the operon: b2534" 

+ Operon 1498597..1500179 note = "documented tehAB 
operon" 

Operon complement(3961980..3963245) note = "predicted 
operon" note - 'ordered genes contained in the operon: rhlB" 
+ Operon 3572704..3573297 note = "predicted operon" note 
-'ordered genes contained in the operon: b3434" 
Operon complement(3809518..3810192) note = "predicted 
operon" note ^"ordered genes contained in the operon: radC" 
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Iux-z.pk020.c8 
Iux-z.pk020.e3 
Iux-z.pk020.h21 
Iux-z.pk020.j20 

Iux-z.pk020.j9 
lux-z.pk020.121. 
Iux-z.pk020.nl 2 

Iux-z.pk021.al4 

Iux-z.pk021.b3 

Iux-z.pk021.cl0 

lux-z.pk021.dl 

Iux-z.pk021.d22 

Iux-z.pk021.el0 

Iux-z.pk021.gl6 

Iux-z.pk021.hl4 
Iux-z.pk021.h3 
Iux-z.pk021.i24 
Iux-z.pk021.k6 



Operon 4292060..4293373 note = "predicted operon" note 
= M ordered genes contained in the operon: gltP" 

Operon 1 67484.. 1 73444 note = "documented fhuACDB 
operon" 

Operon 40079 18.. 4008433 note = "predicted operon" note 
="ordered genes contained in the operon: yigL" 

Operon compiement(379293.. 38051 1) note = "predicted 
operon" note ="ordered genes contained in the operon: b0358 
b0359" 

Operon 2322776..2324098 note = "predicted operon" note 
-'ordered genes contained in the operon: atoE" 

Operon 1329072.. 133 1669 note = "predicted operon" note 
="ordered genes contained in the operon: topA" 

Operon complement(216179..218775) note = "predicted 
operon" note - 'ordered genes contained in the operon: yaeF 
proS" 

Operon 4277559..4279208 note = "predicted operon" note 
="ordered genes contained in the operon: yjcE" 

Operon complement(29623 83.. 2964059) note = "documented 
Igt-thyA operon" 

Operon 34079 17..3408 882 note = "predicted operon" note 

- 'ordered genes contained in the operon: yhdG" 

Operon 1687876.. 1689384 note = "predicted operon" note 

— 'ordered genes contained in the operon: bl614" 

Operon complement(4523674..4525548) note = "predicted 

operon" note -'ordered genes contained in the operon: yjhJ 

yjhKyjhL" 

Operon complement(2287085.. 2288 101) note = "predicted 
operon" note =" ordered genes contained in the operon: 
yi52__8" 

Operon complement(242 1756.. 2423 936) note = "predicted 
operon" note ~ 'ordered genes contained in the operon: hisP 
hisMhisQ" 

Operon complement(3096577..3097584) note = "predicted 

operon" note - 'ordered genes contained in the operon: yggM" 

Operon 1805 820.. 1806680 note = "predicted operon" note 

="ordered genes contained in the operon: bl725" 

Operon 3814303..3815166 note = "predicted operon" note 

^"ordered genes contained in the operon: yicC" 

Operon complement(4375389..4376522) note = "predicted 

operon" note ="ordered genes contained in the operon: ampC" 
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Operon complement(2660603..2661343) note = "predicted 
operon" note =" ordered genes contained in the operon: 
b2532" 

+ Operon 3225442..3228880 note = "predicted operon" note 
- 'ordered genes contained in the operon: ygj J ygjK" 

+ Operon 3225442.. 3228880 note = "predicted operon" note 
="ordered genes contained in the operon: ygj J ygjK" 
Operon complement(27 1 0047..27 1 0904) note = "predicted 
operon" note ="ordered genes contained in the operon: yfiC" 

+ Operon 3490205. .3491386 note - "predicted operon" note 
-'ordered genes contained in the operon: yhfC" 

+ Operon 4199504..4200901 note = "documented hydH operon" 
Operon complement(635939..636841) note - "predicted 
operon" note =" ordered genes contained in the operon: 
b0603" 

Operon complement(l 56299.. 156883) note = "predicted 
operon" note ="ordered genes contained in the operon: yadN" 
Operon complement(2432 1 02. .243276 1 ) note = "predicted 
operon" note =" ordered genes contained in the operon: dedA" 
+ Operon 2264265.. 2265731 note = "predicted operon" note 
— 'ordered genes contained in the operon: yeiQ" 

Operon coraplement(2221958..2222899) note = "predicted 
operon" note =" ordered genes contained in the operon: pbpG" 
Operon complement(3960360..3961844) note = "predicted 
operon" note - 'ordered genes contained in the operon: gppA" 
+ Operon 2 163690..2 165051 note = "predicted operon" note 
=" ordered genes contained in the operon: b2081 " 

Operon complement (2 940940. .2941 170) note = "predicted 
operon" note - ' ordered genes contained in the operon: 
b2809" 

+ Operon 2597860..2598968 note = "predicted operon" note 
=" ordered genes contained in the operon: gcvR bcp" 

Operon complement(4486 129..448763 1) note = "predicted 
operon" note ^"ordered genes contained in the operon: yjgR" 
+ Operon 4002473. .4003342 note = "predicted operon" note 
="ordered genes contained in the operon: pldA" 

+ Operon 1 277 1 80. . 1 27857 1 note = "documented narK operon" 
Operon complement(4282992..4284950) note = "predicted 
operon" note —"ordered genes contained in the operon: acs" 
+ Operon 1 194346.. 1 195596 note - "documented icdA operon" 
+ Operon 3387155..3388084 note = "predicted operon" note 
- 'ordered genes contained in the operon: b3243" 
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Operon 3418958..3420830 note = "predicted operon" note 
- 'ordered genes contained in the operon: yhdY yhdZ" 
Operon 8551 86..856778 note = "predicted operon" note 

— 'ordered genes contained in the operon: b0820" 
Operon complement(2957082..2962199) note = "predicted 
operon" note ="ordered genes contained in the operon: recC 
ppdC ygdB ppdB ppdA". 

Operon complement(3436342.. 3437146) note = "predicted 
operon" note ^"ordered genes contained in the operon: yhdM 
yhdN" 

Operon complement(3239467.. 3242381) note = "documented 
uxaCA operon" 

Operon complement 1 690914.. 1694095) note = "predicted 
operon" note - 'ordered genes contained in the operon: uidB 
uidA" 

Operon complement(l 928481 ..1928771) note = "predicted 
operon" note ="ordered genes contained in the operon: yebG" 

Operon 3963846..3965291 note = "predicted operon" note 
^"ordered genes contained in the operon: rhoL rho" 

Operon 1426547.. 1427008 note - "predicted operon" note 
=" ordered genes contained in the operon: bl371" 

Operon complement^ 12705 8. .3 128230) note = "predicted 
operon" note =" ordered genes contained in the operon: 
b2981" 

Operon 4380191..4381 198 note = "predicted operon" note 

— 'ordered genes contained in the operon: yjeA" 

Operon 22632 15. .2264042 note = "predicted operon" note 

— 'ordered genes contained in the operon: yeiP" 

Operon complement(l 1 86342.. 1 187472) note = "predicted 
operon" note - 'ordered genes contained in the operon: ycfD" 

Operon complement^! 349852.. 1355134) note = "predicted 
operon" note - 'ordered genes contained in the operon: sapF 
sapD sapC sapB sapA" 

Operon 1146017.. 1146538 note = "predicted operon" note 

="ordered genes contained in the operon: yceD" 

Operon complement(2549297..2550269) note = "predicted 

operon" note - 'ordered genes contained in the operon: b2433 

b2434" 

Operon 1391230..1392864 note = "predicted operon" note 
— 'ordered genes contained in the operon: bl 329" 
Operon complement(4487709. .4488707) note = "predicted 
operon" note - 'ordered genes contained in the operon: yjgS" 
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lux-z.pk025.jll 
Iux-z.pk025.jl6 
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Iux-z.pk025.m6 

Iux-z.pk025.m7 
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Iux-z.pk014.h24 

iux-z.pk014.18 

Iux-z.pk020.gl5 
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lux-t.pk001.al 

Iux-t.pk001.a2 
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lux-a.pk068.cl 

Iux-tpk001.a4 
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Iux-t.pk001.a9 
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Operon complement(l 73 1 778.. 1 7321 25) note = "predicted 
operon" note -'ordered genes contained in the operon: ydhD" 
Operon 4098391 ..409901 1 note - "documented sodA operon" 
Operon complement(1261249.. 1262723) note = "predicted 
operon" note ="ordered genes contained in the operon: ychB 
hemM" 

Operon complement(529356..530450) note - "predicted 

operon" note ="ordered genes contained in the operon: ybbB" 

Operon 1712401. .1713006 note = "predicted operon" note 

— 'ordered genes contained in the operon: gst" 

Operon 3578769..3579257 note = "predicted operon" note 

="ordered genes contained in the operon: b3441 " 

Operon complement(l 58 1786.. 158 1983) note = "predicted 

operon" note =" ordered genes contained in the operon: 

bl500" 

Operon 3 775026.. 3776681 note = "documented lldP operon" 

Operon 3057773. .3058666 note = "predicted operon" note 

^"ordered genes contained in the operon: iciA" 

Operon 402927.. 404042 note = "predicted operon" note 

="ordered genes contained in the operon: yaiC" 

surA 

yacK 

map 

betT 

yaiB 

proC 

bioA 

osmY 

inaA 

hisPl 

hisP3 

katG 

yebF 

flhB 

sgcRCyjhJ) 

flhC 

recA 
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Table 20 

1st gene of the operon 45 minutes 90 minutes 135 minutes 
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CLAIMS 

What is claimed is: 

1 . A method for determining gene function between at least two genome-registered 
collections comprising: 

(a) assembling at least two genome-wide scale, genome-registered collections; 

(b) perturbing each collection from (a) with at least one perturbation; 

(c) measuring the response of each collection to each perturbation of (b); 

(d) analyzing the results of the at least one perturbation to identify patterns of 
similarities and differences between the at least two genome-registered 
collections. 

2. A method according to Claim 1 wherein the perturbation is selected from the 
group consisting of radiation, humidity, alterations in temperature, alterations in carbon 
source, alterations in energy source, alterations in nitrogen source, alterations in phosphorus 
source, alterations in sulfur source, alterations in trace element sources, a change in pH, the 
presence other organisms, the presence of chemicals, the presence of toxins, and abnormal 
levels of normal metabolites. 

3. A method for generating a genome-registered collection of reporter gene fusions 
comprising the steps of: 

(a) generating a set of gene fusions comprising: 

1) a reporter gene or reporter gene complex operably linked to 

2) a genomic fragment from an organism of which at least 15% of the 
genomic nucleotide sequence is known; 

(b) introducing in vitro the reporter gene fusions from step (a) into a host 
organism; 

(c) registering the reporter gene fusions on the basis of sequence homology to 
the genomic sequence of the organism; 

. (d) repeating (a), (b), and/or (c) until reporter gene fusions have been made to 
at least 1 5% of the known genomic nucleotide sequence of said organism. 

4. A method according to Claim 3 wherein the gene fusions of step (a) are 
generated either in vivo or in vitro. 

5. A method for generating a genome-registered collection of reporter gene fusions 
comprising: 

(a) generating random nucleic acid fragments from the DNA of an organism 
of which at least 15% of the nucleotide sequence is known; 

(b) operably linking the random nucleic acid fragments generated in (a) to a 
vector containing a promoterless reporter gene or reporter gene complex; 

(c) introducing the vector (b) containing the gene fusions into a host organism; 
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(d) determining the nucleic acid sequence of the distal and the proximal ends 
of the random nucleic fragments relative to the reporter gene or reporter 
gene complex; 

(e) registering the sequenced fusions of step (d) on the basis of sequence 
5 homology to the genomic sequence of the host organism; 

(d) repeating (a), (b), and/or (c) until reporter gene fusions have been made to 
at least 15% of the known genomic nucleotide sequence of said organism 

6. A method according to Claim 5 wherein the random nucleic acid fragments of 
step (a) are generated by method selected from the group consisting of restriction enzyme 

10 digestion, physical shearing of the genome and polymerase chain reaction. 

7. A method for generating a genome-registered collection of reporter gene fusions 
comprising: 

(a) providing a genome from an organism wherein at least 1 5% of the 
nucleotide sequence is known; 
15 (b) providing a series of amplification primers having homology to specific 

known regions of the genome of (a) ; 

(c) amplifying portions of the genome of (a) with the primers of (b) to create a 
collection of nucleic acid amplification products; 

(d) operably linking the amplification products of (c) to a vector containing a 
20 promoterless reporter gene or reporter gene complex; 

(e) introducing the reporter gene fusions into a said organism; 

(f) repeating (a) - (e) until, until reporter gene fusions have been made to at 
least 15% of the known genomic nucleotide sequence of said organism. 

8. A method for generating a genome-registered collection of reporter gene fusions 
25 comprising steps of: 

(a) introducing one or more transposons into the genome of an organism of 
which at least 15% of the nucleotide sequence is known, each transposon 
containing a promoterless reporter gene or reporter gene complex; 

(b) determining the nucleic acid sequence of the junction between the proximal 
30 end of the genomic DNA and the transposon containing the reporter gene 

or reporter gene complex and registering the reporter gene fusions relative 
to the genomic sequence of the organism, . 

(c) repeating (a) and (b) until reporter gene fusions have been made to at least 
15% of the known genomic nucleotide sequence of said organism 

35 9. A method according to any one of Claims 1, 3, 5, 7 or 8 wherein organism is 

selected from the group consisting of prokaryotes and fungi. 

10. A method according to Claim 9 wherein the prokaryote is an enteric bacterium. 
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11. A method according to Claim 1 0 wherein the enteric bacterium is selected from 
the group consisting of Escherichia and Salmonella. 

12. A method according to one of Claims 1,3,5, 7 or 8 wherein the reporter gene or 
reporter gene complex is selected from the group consisting of hixCDABE> lacZ, gfp, cat, 

5 galK, inaZ, luc, luxAB, bgaB, nptll.phoA, uidA and xylE. 

13: A method according to one of Claims 1, 3, 5, 7 or 8 wherein at least 50% of the 
genomic nucleotides sequence is known. 

14. A method for identifying a profile of inducing conditions for a reporter gene 
fusion comprising: 

10 (a) obtaining a gene expression profile of an organism under induced and non- 

induced conditions wherein induced genes are identified; 

(b) providing a genome-registered collection of reporter gene fusions, said 
fusions registered to the genome of the organism of (a); 

(c) selecting the reporter gene fusions of (b) that correspond to the induced 
15 genes of (a) to create a subset of the genome-register collection; 

(d) contacting the subset of the genome-register collection of (c) with the 
inducing conditions of (a) to identify at least one representative reporter 
gene fusion whose expression was altered in a similar manner as in (a); 

(e) contacting the at least one representative reporter gene fusion of (d) in a 

20 high throughput manner with a multiplicity of different inducing conditions 

to identify a profile of inducing conditions for that reporter gene fusion. 

15. A method according to Claim 14 wherein at least 15% of the genomic nucleotide 
sequence of said organism is known. 

16. A method for analyzing a gene expression-based high throughput assay 
25 comprising: 

(a) analyzing a genome-wide gene expression assay of a microbial culture with 
a genotype of interest relative to that of a congenic strain to identify genes 
whose expression differs as a function of the allele; 

(b) selecting corresponding reporter gene fusions containing promoter regions 
30 operably linked to the genes with altered expression from (a) or co- 
regulated reporter gene fusions containing promoter regions operably 
linked to the genes co-regulated with those displaying altered expression 
from (a) from the genome-registered collection of reporter gene fusion; 

(c) testing expression of the reporter gene fusions selected from (b) by 
35 introducing the reporter gene fusions into the congenic strains; 

(d) choosing at least one representative reporter gene fusion whose expression 
differed in the two strains tested in (c); 
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(e) using the reporter gene fusion as a high throughput screen to find other 
chemicals or genes that alter expression of the fusion. 

17. A method to validate results from comprehensive genome analysis comprising 
the steps of: 

5 (a) analyzing a genome- wide, gene expression assay of an organism treated 

with a condition or chemical of interest to identify genes with altered 
expression; 

(b) selecting from a genome-registered collection of reporter gene fusions 
those reporter gene fusions containing promoter regions operably linked to 

10 genes corresponding to the altered genes from (a) or genes co-regulated 

with genes corresponding to the altered genes from (a); 

(c) testing expression of the reporter gene fusions selected from (b) with the 
conditions or chemicals of interest used in (a); and 

(d) comparing the gene expression results from (c) to the gene expression 
15 result of (a). 

18. A method to determine operon structure comprising steps of: 

(a) selecting- a subset of reporter gene fusions from a genome-registered 
collection of reporter gene fusions that map to the region of a possible 
operon; 

20 (b) assaying the subset for the reporter gene function; and 

(c) determining a putative operon structure based on the quantities of reporter 
gene function. 

19. A method for constructing a cellular array containing reporter gene fusions 
comprising: 

25 (a) generating a set of gene fusions comprising: 

1) a reporter gene or reporter gene complex operably linked to 

2) a genomic fragment from an organism of which at least 1 5% of the 
genomic nucleotide sequence is known; 

(b) selecting a non-redundant subset of reporter gene fusions from the set of 
30 (a) representative of at least 15% of known or suspected promoter regions 

from a genome-registered collection of reporter gene fusions,, each 
containing a known or suspected promoter region operably linked to a 
reporter gene or reporter gene complex; and 

(c) fixing the non-redundant subset of reporter gene fusions of (b) in an array 
35 format. 

20. A method for measuring gene expression responses to perturbation comprising: 
(a) constructing at least 2 identical cellular arrays, each cellular array 

comprising a reporter gene fusion comprising: 
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1) a reporter gene or reporter gene complex operably linked to 

2) a genomic fragment from an organism of which at least 1 5% of the 
genomic nucleotide sequence is known; 

wherein at least one cellular array is a control array and at least one cellular array 
is an experimental array; 

(b) contacting the experimental array of (a) with a perturbing condition; 

(c) comparing the differences between the gene expression activity of the 
control and the experimental array wherein gene expression response to a 
perturbing condition is determined. 

21 . The method of Claim 20 wherein the cellular array is fixed in a manner 
selected from the group consisting of, fixed on a solid medium, and arrayed in liquid 
medium. 

22. The method of Claim 20 wherein the perturbing condition is selected from the 
group consisting of radiation, humidity, alterations in temperature, alterations in carbon 
source, alterations in energy source, alterations in nitrogen source, alterations in phosphorus 
source, alterations in sulfur source, alterations in trace element sources, a change in pH, the 
presence other organisms, the presence of chemicals, the presence of toxins, and abnormal 
levels of normal metabolites. 
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